1
|
Fisher AA, Gonzalez LS, Cappel ZR, Grover KE, Waclaw RR, Robinson JE. Dopaminergic encoding of future defensive actions in the mouse nucleus accumbens. PNAS NEXUS 2025; 4:pgaf128. [PMID: 40321418 PMCID: PMC12046218 DOI: 10.1093/pnasnexus/pgaf128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Accepted: 04/04/2025] [Indexed: 05/08/2025]
Abstract
Dopamine release in the nucleus accumbens (NAc) plays a critical role in the motivation to perform actions that promote survival. However, the NAc dopamine response to innately threatening visual stimuli, such as predators descending from above, and the innate behaviors they promote has not been fully characterized. Using the genetically encoded sensor dLight1, we investigated looming visual threat-evoked dopamine release in the lateral (LNAc) and medial NAc shell (NAcS) regions in freely moving mice during performance of a looming stimulus assay. We found that dopamine release related to visual threat perception in the NAcS, but not in the LNAc, predicts the timing and vigor of a future defensive action, yet dopamine released during the performance of the action itself does not. Optogenetic inhibition of dopaminergic terminals in the NAcS at visual stimulus onset prevented escape, confirming a role for ventral striatal dopamine in promoting threat-related behaviors.
Collapse
Affiliation(s)
- Austen A Fisher
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - L Sofia Gonzalez
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
- Neuroscience Graduate Program, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Zoe R Cappel
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
- Neuroscience Graduate Program, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Kassidy E Grover
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
- Neuroscience Graduate Program, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Ronald R Waclaw
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - J Elliott Robinson
- Division of Experimental Hematology and Cancer Biology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| |
Collapse
|
2
|
Kahnt T, Schoenbaum G. The curious case of dopaminergic prediction errors and learning associative information beyond value. Nat Rev Neurosci 2025; 26:169-178. [PMID: 39779974 DOI: 10.1038/s41583-024-00898-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/11/2024] [Indexed: 01/11/2025]
Abstract
Transient changes in the firing of midbrain dopamine neurons have been closely tied to the unidimensional value-based prediction error contained in temporal difference reinforcement learning models. However, whereas an abundance of work has now shown how well dopamine responses conform to the predictions of this hypothesis, far fewer studies have challenged its implicit assumption that dopamine is not involved in learning value-neutral features of reward. Here, we review studies in rats and humans that put this assumption to the test, and which suggest that dopamine transients provide a much richer signal that incorporates information that goes beyond integrated value.
Collapse
Affiliation(s)
- Thorsten Kahnt
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| |
Collapse
|
3
|
Fry BR, Russell N, Fex V, Mo B, Pence N, Beatty JA, Manfredsson FP, Toth BA, Burgess CR, Gershman S, Johnson AW. Devaluing memories of reward: a case for dopamine. Commun Biol 2025; 8:161. [PMID: 39900665 PMCID: PMC11790953 DOI: 10.1038/s42003-024-07440-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 12/23/2024] [Indexed: 02/05/2025] Open
Abstract
Midbrain dopamine cells encode differences in predictive and expected value to support learning through reward prediction error. Recent findings have questioned whether reward prediction error can fully account for dopamine function and suggest a more complex role for dopamine in encoding detailed features of the reward environment. In this series of studies, we describe a novel role for dopamine in devaluing sensory features of reward. Mesencephalic dopamine cells activated during a mediated devaluation phase were later chemogenetically reactivated. This retrieval of the devalued reward memory elicited a reduction in the hedonic evaluation of sucrose reward. Through optogenetic and chemogenetic manipulations, we confirm dopamine cells are both sufficient and necessary for mediated devaluation, and retrieval of these memories reflected dopamine release in the nucleus accumbens. Consistent with our computational modeling data, our findings indicate a critical role for dopamine in encoding predictive representations of the sensory features of reinforcement. Overall, we elucidate a novel role for dopamine function in mediated devaluation and illuminate a more elaborate framework through which dopamine encodes reinforcement signals.
Collapse
Affiliation(s)
- Benjamin R Fry
- Department of Psychology, Michigan State University, East Lansing, MI, USA
- Department of Biobehavioral Health, The Pennsylvania State University, University Park, PA, USA
| | - Nicolette Russell
- Department of Psychology, Michigan State University, East Lansing, MI, USA
| | - Victoria Fex
- Lyman Briggs College, Michigan State University, East Lansing, MI, USA
| | - Bing Mo
- Department of Psychology, Michigan State University, East Lansing, MI, USA
| | - Nathan Pence
- Department of Psychology, Michigan State University, East Lansing, MI, USA
| | - Joseph A Beatty
- Department of Physiology, Michigan State University, East Lansing, MI, USA
- Neuroscience Program, Michigan State University, East Lansing, MI, USA
| | - Fredric P Manfredsson
- Department of Translational Neuroscience, Barrow Neurological Institute, Phoenix, AZ, USA
| | - Brandon A Toth
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA
| | | | - Samuel Gershman
- Department of Psychology, Harvard University, Cambridge, MA, USA
| | - Alexander W Johnson
- Department of Psychology, Michigan State University, East Lansing, MI, USA.
- Neuroscience Program, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
4
|
Peters J. A neurocomputational account of multi-line electronic gambling machines. Trends Cogn Sci 2025:S1364-6613(24)00330-9. [PMID: 39818443 DOI: 10.1016/j.tics.2024.12.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/13/2024] [Accepted: 12/16/2024] [Indexed: 01/18/2025]
Abstract
Multi-line electronic gambling machines (EGMs) are strongly associated with problem gambling. Dopamine (DA) plays a central role in substance-use disorders, which share clinical and behavioral features with disordered gambling. The structural design features of multi-line EGMs likely lead to the elicitation of various dopaminergic effects within their nested anticipation-outcome structure. The present account draws an analogy between EGM gambling and latent state inference accounts of conditioning, and links maladaptive gambling-related beliefs and expectancies to a process of erroneous latent state inference that may be exacerbated by EGM design features and associated dopaminergic processes. Over the course of repeated exposure to gambling, these processes may foster the emergence of maladaptive state priors, which clinically manifest as gambling-related cognitions, beliefs, and expectancies.
Collapse
Affiliation(s)
- J Peters
- Department of Psychology, Biological Psychology, University of Cologne, Cologne, Germany.
| |
Collapse
|
5
|
Takahashi YK, Zhang Z, Kahnt T, Schoenbaum G. Dopaminergic responses to identity prediction errors depend differently on the orbitofrontal cortex and hippocampus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.11.628003. [PMID: 39763911 PMCID: PMC11702580 DOI: 10.1101/2024.12.11.628003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2025]
Abstract
Adaptive behavior depends on the ability to predict specific events, particularly those related to rewards. Armed with such associative information, we can infer the current value of predicted rewards based on changing circumstances and desires. To support this ability, neural systems must represent both the value and identity of predicted rewards, and these representations must be updated when they change. Here we tested whether prediction error signaling of dopamine neurons depends on two areas known to represent the specifics of rewarding events, the HC and OFC. We monitored the spiking activity of dopamine neurons in rat VTA during changes in the number or flavor of expected rewards designed to induce errors in the prediction of reward value or reward identity, respectively. In control animals, dopamine neurons registered both error types, transiently increasing firing to additional drops of reward or changes in reward flavor. These canonical firing signatures of value and identity prediction errors were significantly disrupted in rats with ipsilateral neurotoxic lesions of either HC or OFC. Specifically, HC lesions caused a failure to register either type of prediction error, whereas OFC lesions caused persistent signaling of identity prediction errors and much more subtle effects on signaling of value errors. These results demonstrate that HC and OFC contribute distinct types of information to the computation of prediction errors signaled by dopaminergic neurons.
Collapse
Affiliation(s)
- Yuji K Takahashi
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | - Zhewei Zhang
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | - Thorsten Kahnt
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| |
Collapse
|
6
|
Carvalho W, Tomov MS, de Cothi W, Barry C, Gershman SJ. Predictive Representations: Building Blocks of Intelligence. Neural Comput 2024; 36:2225-2298. [PMID: 39212963 DOI: 10.1162/neco_a_01705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 06/10/2024] [Indexed: 09/04/2024]
Abstract
Adaptive behavior often requires predicting future events. The theory of reinforcement learning prescribes what kinds of predictive representations are useful and how to compute them. This review integrates these theoretical ideas with work on cognition and neuroscience. We pay special attention to the successor representation and its generalizations, which have been widely applied as both engineering tools and models of brain function. This convergence suggests that particular kinds of predictive representations may function as versatile building blocks of intelligence.
Collapse
Affiliation(s)
- Wilka Carvalho
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA 02134, U.S.A.
| | - Momchil S Tomov
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, MA 02134, U.S.A
- Motional AD LLC, Boston, MA 02210, U.S.A.
| | - William de Cothi
- Department of Cell and Developmental Biology, University College London, London WC1E 7JE, U.K.
| | - Caswell Barry
- Department of Cell and Developmental Biology, University College London, London WC1E 7JE, U.K.
| | - Samuel J Gershman
- Kempner Institute for the Study of Natural and Artificial Intelligence, and Department of Psychology and Center for Brain Science, Harvard University, Cambridge, MA 02134, U.S.A
- Center for Brains, Minds, and Machines, MIT, Cambridge, MA 02139, U.S.A.
| |
Collapse
|
7
|
Howard JD, Edmonds D, Schoenbaum G, Kahnt T. Distributed midbrain responses signal the content of positive identity prediction errors. Curr Biol 2024; 34:4240-4247.e4. [PMID: 39197457 PMCID: PMC11421979 DOI: 10.1016/j.cub.2024.07.105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 06/12/2024] [Accepted: 07/31/2024] [Indexed: 09/01/2024]
Abstract
Recent work across species has shown that midbrain dopamine neurons signal not only errors in the prediction of reward value but also in the prediction of value-neutral sensory features. To support learning of associative structures in downstream areas, identity prediction errors (iPEs) should signal specific information about the mis-predicted outcome. Here, we used pattern-based analysis of functional magnetic resonance imaging (fMRI) data acquired during reversal learning to characterize the information content of iPE responses in the human midbrain. We find that fMRI responses to value-neutral identity errors contain information about the identity of the unexpectedly received reward (positive iPE+) but not about the identity of the omitted reward (negative iPE-). Exploratory analyses revealed representations of iPE- in the dorsomedial prefrontal cortex. These results demonstrate that ensemble midbrain responses to value-neutral identity errors convey information about the identity of unexpectedly received outcomes, which could shape the formation of novel stimulus-outcome associations that constitute cognitive maps.
Collapse
Affiliation(s)
- James D Howard
- Department of Psychology, Brandeis University, Waltham, MA 02453, USA.
| | - Donnisa Edmonds
- Department of Neurology, Northwestern University, Chicago, IL 60611, USA
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD 21224, USA
| | - Thorsten Kahnt
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD 21224, USA.
| |
Collapse
|
8
|
Feng GW, Rutledge RB. Surprising sounds influence risky decision making. Nat Commun 2024; 15:8027. [PMID: 39271674 PMCID: PMC11399252 DOI: 10.1038/s41467-024-51729-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 08/14/2024] [Indexed: 09/15/2024] Open
Abstract
Adaptive behavior depends on appropriate responses to environmental uncertainty. Incidental sensory events might simply be distracting and increase errors, but alternatively can lead to stereotyped responses despite their irrelevance. To evaluate these possibilities, we test whether task-irrelevant sensory prediction errors influence risky decision making in humans across seven experiments (total n = 1600). Rare auditory sequences preceding option presentation systematically increase risk taking and decrease choice perseveration (i.e., increased tendency to switch away from previously chosen options). The risk-taking and perseveration effects are dissociable by manipulating auditory statistics: when rare sequences end on standard tones, including when rare sequences consist only of standard tones, participants are less likely to perseverate after rare sequences but not more likely to take risks. Computational modeling reveals that these effects cannot be explained by increased decision noise but can be explained by value-independent risky bias and perseveration parameters, decision biases previously linked to dopamine. Control experiments demonstrate that both surprise effects can be eliminated when tone sequences are presented in a balanced or fully predictable manner, and that surprise effects cannot be explained by erroneous beliefs. These findings suggest that incidental sounds may influence many of the decisions we make in daily life.
Collapse
Affiliation(s)
- Gloria W Feng
- Department of Psychology, Yale University, New Haven, CT, USA.
| | - Robb B Rutledge
- Department of Psychology, Yale University, New Haven, CT, USA.
- Wu Tsai Institute, Yale University, New Haven, CT, USA.
- Department of Psychiatry, Yale University, New Haven, CT, USA.
- Wellcome Centre for Human Neuroimaging, UCL, London, UK.
| |
Collapse
|
9
|
Gershman SJ, Assad JA, Datta SR, Linderman SW, Sabatini BL, Uchida N, Wilbrecht L. Explaining dopamine through prediction errors and beyond. Nat Neurosci 2024; 27:1645-1655. [PMID: 39054370 DOI: 10.1038/s41593-024-01705-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 06/13/2024] [Indexed: 07/27/2024]
Abstract
The most influential account of phasic dopamine holds that it reports reward prediction errors (RPEs). The RPE-based interpretation of dopamine signaling is, in its original form, probably too simple and fails to explain all the properties of phasic dopamine observed in behaving animals. This Perspective helps to resolve some of the conflicting interpretations of dopamine that currently exist in the literature. We focus on the following three empirical challenges to the RPE theory of dopamine: why does dopamine (1) ramp up as animals approach rewards, (2) respond to sensory and motor features and (3) influence action selection? We argue that the prediction error concept, once it has been suitably modified and generalized based on an analysis of each computational problem, answers each challenge. Nonetheless, there are a number of additional empirical findings that appear to demand fundamentally different theoretical explanations beyond encoding RPE. Therefore, looking forward, we discuss the prospects for a unifying theory that respects the diversity of dopamine signaling and function as well as the complex circuitry that both underlies and responds to dopaminergic transmission.
Collapse
Affiliation(s)
- Samuel J Gershman
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, MA, USA.
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA.
| | - John A Assad
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | | | - Scott W Linderman
- Department of Statistics and Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA
| | - Bernardo L Sabatini
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Naoshige Uchida
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Linda Wilbrecht
- Department of Psychology and Helen Wills Neuroscience Institute, University of California, Berkeley, CA, USA
| |
Collapse
|
10
|
Taira M, Millard SJ, Verghese A, DiFazio LE, Hoang IB, Jia R, Sias A, Wikenheiser A, Sharpe MJ. Dopamine Release in the Nucleus Accumbens Core Encodes the General Excitatory Components of Learning. J Neurosci 2024; 44:e0120242024. [PMID: 38969504 PMCID: PMC11358529 DOI: 10.1523/jneurosci.0120-24.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 06/18/2024] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Dopamine release in the nucleus accumbens core (NAcC) is generally considered to be a proxy for phasic firing of the ventral tegmental area dopamine (VTADA) neurons. Thus, dopamine release in NAcC is hypothesized to reflect a unitary role in reward prediction error signaling. However, recent studies reveal more diverse roles of dopamine neurons, which support an emerging idea that dopamine regulates learning differently in distinct circuits. To understand whether the NAcC might regulate a unique component of learning, we recorded dopamine release in NAcC while male rats performed a backward conditioning task where a reward is followed by a neutral cue. We used this task because we can delineate different components of learning, which include sensory-specific inhibitory and general excitatory components. Furthermore, we have shown that VTADA neurons are necessary for both the specific and general components of backward associations. Here, we found that dopamine release in NAcC increased to the reward across learning while reducing to the cue that followed as it became more expected. This mirrors the dopamine prediction error signal seen during forward conditioning and cannot be accounted for temporal-difference reinforcement learning. Subsequent tests allowed us to dissociate these learning components and revealed that dopamine release in NAcC reflects the general excitatory component of backward associations, but not their sensory-specific component. These results emphasize the importance of examining distinct functions of different dopamine projections in reinforcement learning.
Collapse
Affiliation(s)
- Masakazu Taira
- Department of Psychology, University of Sydney, Camperdown, New South Wales 2006, Australia
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Samuel J Millard
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Anna Verghese
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Lauren E DiFazio
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Ivy B Hoang
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Ruiting Jia
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Ana Sias
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Andrew Wikenheiser
- Department of Psychology, University of California, Los Angeles 90095, California
| | - Melissa J Sharpe
- Department of Psychology, University of Sydney, Camperdown, New South Wales 2006, Australia
- Department of Psychology, University of California, Los Angeles 90095, California
| |
Collapse
|
11
|
Heng JG, Zhang J, Bonetti L, Lim WPH, Vuust P, Agres K, Chen SHA. Understanding music and aging through the lens of Bayesian inference. Neurosci Biobehav Rev 2024; 163:105768. [PMID: 38908730 DOI: 10.1016/j.neubiorev.2024.105768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 06/05/2024] [Accepted: 06/10/2024] [Indexed: 06/24/2024]
Abstract
Bayesian inference has recently gained momentum in explaining music perception and aging. A fundamental mechanism underlying Bayesian inference is the notion of prediction. This framework could explain how predictions pertaining to musical (melodic, rhythmic, harmonic) structures engender action, emotion, and learning, expanding related concepts of music research, such as musical expectancies, groove, pleasure, and tension. Moreover, a Bayesian perspective of music perception may shed new insights on the beneficial effects of music in aging. Aging could be framed as an optimization process of Bayesian inference. As predictive inferences refine over time, the reliance on consolidated priors increases, while the updating of prior models through Bayesian inference attenuates. This may affect the ability of older adults to estimate uncertainties in their environment, limiting their cognitive and behavioral repertoire. With Bayesian inference as an overarching framework, this review synthesizes the literature on predictive inferences in music and aging, and details how music could be a promising tool in preventive and rehabilitative interventions for older adults through the lens of Bayesian inference.
Collapse
Affiliation(s)
- Jiamin Gladys Heng
- School of Computer Science and Engineering, Nanyang Technological University, Singapore.
| | - Jiayi Zhang
- Interdisciplinary Graduate Program, Nanyang Technological University, Singapore; School of Social Sciences, Nanyang Technological University, Singapore; Centre for Research and Development in Learning, Nanyang Technological University, Singapore
| | - Leonardo Bonetti
- Center for Music in the Brain, Department of Clinical Medicine, Aarhus University & The Royal Academy of Music, Aarhus, Aalborg, Denmark; Centre for Eudaimonia and Human Flourishing, Linacre College, University of Oxford, United Kingdom; Department of Psychiatry, University of Oxford, United Kingdom; Department of Psychology, University of Bologna, Italy
| | | | - Peter Vuust
- Center for Music in the Brain, Department of Clinical Medicine, Aarhus University & The Royal Academy of Music, Aarhus, Aalborg, Denmark
| | - Kat Agres
- Centre for Music and Health, National University of Singapore, Singapore; Yong Siew Toh Conservatory of Music, National University of Singapore, Singapore
| | - Shen-Hsing Annabel Chen
- School of Social Sciences, Nanyang Technological University, Singapore; Centre for Research and Development in Learning, Nanyang Technological University, Singapore; Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore; National Institute of Education, Nanyang Technological University, Singapore.
| |
Collapse
|
12
|
Song MR, Lee SW. Rethinking dopamine-guided action sequence learning. Eur J Neurosci 2024; 60:3447-3465. [PMID: 38798086 DOI: 10.1111/ejn.16426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 04/21/2024] [Accepted: 05/08/2024] [Indexed: 05/29/2024]
Abstract
As opposed to those requiring a single action for reward acquisition, tasks necessitating action sequences demand that animals learn action elements and their sequential order and sustain the behaviour until the sequence is completed. With repeated learning, animals not only exhibit precise execution of these sequences but also demonstrate enhanced smoothness and efficiency. Previous research has demonstrated that midbrain dopamine and its major projection target, the striatum, play crucial roles in these processes. Recent studies have shown that dopamine from the substantia nigra pars compacta (SNc) and the ventral tegmental area (VTA) serve distinct functions in action sequence learning. The distinct contributions of dopamine also depend on the striatal subregions, namely the ventral, dorsomedial and dorsolateral striatum. Here, we have reviewed recent findings on the role of striatal dopamine in action sequence learning, with a focus on recent rodent studies.
Collapse
Affiliation(s)
- Minryung R Song
- Department of Brain and Cognitive Sciences, KAIST, Daejeon, South Korea
| | - Sang Wan Lee
- Department of Brain and Cognitive Sciences, KAIST, Daejeon, South Korea
- Kim Jaechul Graduate School of AI, KAIST, Daejeon, South Korea
- KI for Health Science and Technology, KAIST, Daejeon, South Korea
- Center for Neuroscience-inspired AI, KAIST, Daejeon, South Korea
| |
Collapse
|
13
|
Wüllhorst R, Wüllhorst V, Endrass T. Risk-Taking Is Associated with Decreased Subjective Value Signals and Increased Prediction Error Signals in the Hot Columbia Card Task. J Neurosci 2024; 44:e1337232024. [PMID: 38561225 PMCID: PMC11112641 DOI: 10.1523/jneurosci.1337-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 03/05/2024] [Accepted: 03/08/2024] [Indexed: 04/04/2024] Open
Abstract
It remains a pressing concern to understand how neural computations relate to risky decisions. However, most observations of brain-behavior relationships in the risk-taking domain lack a rigorous computational basis or fail to emulate of the dynamic, sequential nature of real-life risky decision-making. Recent advances emphasize the role of neural prediction error (PE) signals. We modeled, according to prospect theory, the choices of n = 43 human participants (33 females, 10 males) performing an EEG version of the hot Columbia Card Task, featuring rounds of sequential decisions between stopping (safe option) and continuing with increasing odds of a high loss (risky option). Single-trial regression EEG analyses yielded a subjective value signal at centroparietal (300-700 ms) and frontocentral (>800 ms) electrodes and in the delta band, as well as PE signals tied to the feedback-related negativity, P3a, and P3b, and in the theta band. Higher risk preference (total number of risky choices) was linked to attenuated subjective value signals but increased PE signals. Higher P3-like activity associated with the most positive PE in each round predicted stopping in the present round but not risk-taking in the subsequent round. Our findings indicate that decreased representation of decision values and increased sensitivity to winning despite low odds (positive PE) facilitate risky choices at the subject level. Strong neural responses when gains are least expected (the most positive PE on each round) adaptively contribute to safer choices at the trial-by-trial level but do not affect risky choice at the round-by-round level.
Collapse
Affiliation(s)
- Raoul Wüllhorst
- Institute of Clinical Psychology and Psychotherapy, Faculty of Psychology, Technische Universität Dresden, Dresden 01187, Germany
| | - Verena Wüllhorst
- Institute of Clinical Psychology and Psychotherapy, Faculty of Psychology, Technische Universität Dresden, Dresden 01187, Germany
| | - Tanja Endrass
- Institute of Clinical Psychology and Psychotherapy, Faculty of Psychology, Technische Universität Dresden, Dresden 01187, Germany
| |
Collapse
|
14
|
Sias AC, Jafar Y, Goodpaster CM, Ramírez-Armenta K, Wrenn TM, Griffin NK, Patel K, Lamparelli AC, Sharpe MJ, Wassum KM. Dopamine projections to the basolateral amygdala drive the encoding of identity-specific reward memories. Nat Neurosci 2024; 27:728-736. [PMID: 38396258 PMCID: PMC11110430 DOI: 10.1038/s41593-024-01586-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/24/2024] [Indexed: 02/25/2024]
Abstract
To make adaptive decisions, we build an internal model of the associative relationships in an environment and use it to make predictions and inferences about specific available outcomes. Detailed, identity-specific cue-reward memories are a core feature of such cognitive maps. Here we used fiber photometry, cell-type and pathway-specific optogenetic manipulation, Pavlovian cue-reward conditioning and decision-making tests in male and female rats, to reveal that ventral tegmental area dopamine (VTADA) projections to the basolateral amygdala (BLA) drive the encoding of identity-specific cue-reward memories. Dopamine is released in the BLA during cue-reward pairing; VTADA→BLA activity is necessary and sufficient to link the identifying features of a reward to a predictive cue but does not assign general incentive properties to the cue or mediate reinforcement. These data reveal a dopaminergic pathway for the learning that supports adaptive decision-making and help explain how VTADA neurons achieve their emerging multifaceted role in learning.
Collapse
Affiliation(s)
- Ana C Sias
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Yousif Jafar
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Caitlin M Goodpaster
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Tyler M Wrenn
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Nicholas K Griffin
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Keshav Patel
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Melissa J Sharpe
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Integrative Center for Learning and Memory, University of California, Los Angeles, Los Angeles, CA, USA
- Integrative Center for Addictive Disorders, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Psychology, University of Sydney, Sydney, New South Wales, Australia
| | - Kate M Wassum
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA.
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, USA.
- Integrative Center for Learning and Memory, University of California, Los Angeles, Los Angeles, CA, USA.
- Integrative Center for Addictive Disorders, University of California, Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
15
|
Liu Q, Zhao Y, Attanti S, Voss JL, Schoenbaum G, Kahnt T. Midbrain signaling of identity prediction errors depends on orbitofrontal cortex networks. Nat Commun 2024; 15:1704. [PMID: 38402210 PMCID: PMC10894191 DOI: 10.1038/s41467-024-45880-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 02/06/2024] [Indexed: 02/26/2024] Open
Abstract
Outcome-guided behavior requires knowledge about the identity of future rewards. Previous work across species has shown that the dopaminergic midbrain responds to violations in expected reward identity and that the lateral orbitofrontal cortex (OFC) represents reward identity expectations. Here we used network-targeted transcranial magnetic stimulation (TMS) and functional magnetic resonance imaging (fMRI) during a trans-reinforcer reversal learning task to test the hypothesis that outcome expectations in the lateral OFC contribute to the computation of identity prediction errors (iPE) in the midbrain. Network-targeted TMS aiming at lateral OFC reduced the global connectedness of the lateral OFC and impaired reward identity learning in the first block of trials. Critically, TMS disrupted neural representations of expected reward identity in the OFC and modulated iPE responses in the midbrain. These results support the idea that iPE signals in the dopaminergic midbrain are computed based on outcome expectations represented in the lateral OFC.
Collapse
Affiliation(s)
- Qingfang Liu
- National Institute on Drug Abuse Intramural Research Program, Baltimore, MD, 21224, USA
| | - Yao Zhao
- National Institute on Drug Abuse Intramural Research Program, Baltimore, MD, 21224, USA
| | - Sumedha Attanti
- Mayo Clinic Alix School of Medicine, Scottsdale, AZ, 85259, USA
| | - Joel L Voss
- Department of Neurology, The University of Chicago, Chicago, IL, 60611, USA
| | - Geoffrey Schoenbaum
- National Institute on Drug Abuse Intramural Research Program, Baltimore, MD, 21224, USA
| | - Thorsten Kahnt
- National Institute on Drug Abuse Intramural Research Program, Baltimore, MD, 21224, USA.
| |
Collapse
|
16
|
Fiorilli J, Marchesi P, Ruikes T, Huis in ‘t Veld G, Buckton R, Quintero MD, Reiten I, Bjaalie JG, Pennartz CMA. Neural correlates of object identity and reward outcome in the sensory cortical-hippocampal hierarchy: coding of motivational information in perirhinal cortex. Cereb Cortex 2024; 34:bhae002. [PMID: 38314581 PMCID: PMC10847907 DOI: 10.1093/cercor/bhae002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 12/21/2023] [Accepted: 12/24/2023] [Indexed: 02/06/2024] Open
Abstract
Neural circuits support behavioral adaptations by integrating sensory and motor information with reward and error-driven learning signals, but it remains poorly understood how these signals are distributed across different levels of the corticohippocampal hierarchy. We trained rats on a multisensory object-recognition task and compared visual and tactile responses of simultaneously recorded neuronal ensembles in somatosensory cortex, secondary visual cortex, perirhinal cortex, and hippocampus. The sensory regions primarily represented unisensory information, whereas hippocampus was modulated by both vision and touch. Surprisingly, the sensory cortices and the hippocampus coded object-specific information, whereas the perirhinal cortex did not. Instead, perirhinal cortical neurons signaled trial outcome upon reward-based feedback. A majority of outcome-related perirhinal cells responded to a negative outcome (reward omission), whereas a minority of other cells coded positive outcome (reward delivery). Our results highlight a distributed neural coding of multisensory variables in the cortico-hippocampal hierarchy. Notably, the perirhinal cortex emerges as a crucial region for conveying motivational outcomes, whereas distinct functions related to object identity are observed in the sensory cortices and hippocampus.
Collapse
Affiliation(s)
- Julien Fiorilli
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Pietro Marchesi
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Thijs Ruikes
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Gerjan Huis in ‘t Veld
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Rhys Buckton
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Mariana D Quintero
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| | - Ingrid Reiten
- Institute of Basic Medical Sciences, University of Oslo, NO-0316 Oslo, Norway
| | - Jan G Bjaalie
- Institute of Basic Medical Sciences, University of Oslo, NO-0316 Oslo, Norway
| | - Cyriel M A Pennartz
- Systems and Cognitive Neuroscience Group, SILS Center for Neuroscience, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| |
Collapse
|
17
|
Takahashi YK, Stalnaker TA, Mueller LE, Harootonian SK, Langdon AJ, Schoenbaum G. Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model. Nat Neurosci 2023; 26:830-839. [PMID: 37081296 PMCID: PMC10646487 DOI: 10.1038/s41593-023-01310-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/16/2023] [Indexed: 04/22/2023]
Abstract
Dopamine neuron activity is tied to the prediction error in temporal difference reinforcement learning models. These models make significant simplifying assumptions, particularly with regard to the structure of the predictions fed into the dopamine neurons, which consist of a single chain of timepoint states. Although this predictive structure can explain error signals observed in many studies, it cannot cope with settings where subjects might infer multiple independent events and outcomes. In the present study, we recorded dopamine neurons in the ventral tegmental area in such a setting to test the validity of the single-stream assumption. Rats were trained in an odor-based choice task, in which the timing and identity of one of several rewards delivered in each trial changed across trial blocks. This design revealed an error signaling pattern that requires the dopamine neurons to access and update multiple independent predictive streams reflecting the subject's belief about timing and potentially unique identities of expected rewards.
Collapse
Affiliation(s)
- Yuji K Takahashi
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| | - Thomas A Stalnaker
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | - Lauren E Mueller
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | | | - Angela J Langdon
- Intramural Research Program, National Institute of Mental Health, Bethesda, MD, USA.
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| |
Collapse
|
18
|
Sippy T, Tritsch NX. Unraveling the dynamics of dopamine release and its actions on target cells. Trends Neurosci 2023; 46:228-239. [PMID: 36635111 PMCID: PMC10204099 DOI: 10.1016/j.tins.2022.12.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 11/22/2022] [Accepted: 12/15/2022] [Indexed: 01/11/2023]
Abstract
The neuromodulator dopamine (DA) is essential for regulating learning, motivation, and movement. Despite its importance, however, the mechanisms by which DA influences the activity of target cells to alter behavior remain poorly understood. In this review, we describe recent methodological advances that are helping to overcome challenges that have historically hindered the field. We discuss how the employment of these methods is shedding light on the complex dynamics of extracellular DA in the brain, as well as how DA signaling alters the electrical, biochemical, and population activity of target neurons in vivo. These developments are generating novel hypotheses about the mechanisms through which DA release modifies behavior.
Collapse
Affiliation(s)
- Tanya Sippy
- Neuroscience Institute, New York University Grossman School of Medicine, New York, NY, USA; Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, USA.
| | - Nicolas X Tritsch
- Neuroscience Institute, New York University Grossman School of Medicine, New York, NY, USA; Fresco Institute for Parkinson's and Movement Disorders, New York University Langone Health, New York, NY, USA.
| |
Collapse
|
19
|
Marquis M, Wilson RI. Locomotor and olfactory responses in dopamine neurons of the Drosophila superior-lateral brain. Curr Biol 2022; 32:5406-5414.e5. [PMID: 36450284 DOI: 10.1016/j.cub.2022.11.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 08/17/2022] [Accepted: 11/03/2022] [Indexed: 12/03/2022]
Abstract
The Drosophila brain contains about 50 distinct morphological types of dopamine neurons.1,2,3,4 Physiological studies of Drosophila dopamine neurons have been largely limited to one brain region, the mushroom body,5,6,7,8,9,10,11,12,13 where they are implicated in learning.14,15,16,17,18 By comparison, we know little about the physiology of other Drosophila dopamine neurons. Interestingly, a recent whole-brain imaging study found that dopamine neuron activity in several fly brain regions is correlated with locomotion.19 This is notable because many dopamine neurons in the rodent brain are also correlated with locomotion or other movements20,21,22,23,24,25,26,27,28,29,30; however, most rodent studies have focused on learned and rewarded behaviors, and few have investigated dopamine neuron activity during spontaneous (self-timed) movements. In this study, we monitored dopamine neurons in the Drosophila brain during self-timed locomotor movements, focusing on several previously uncharacterized cell types that arborize in the superior-lateral brain, specifically the lateral horn and superior-lateral protocerebrum. We found that activity of all of these dopamine neurons correlated with spontaneous fluctuations in walking speed, with different cell types showing different speed correlations. Some dopamine neurons also responded to odors, but these responses were suppressed by repeated odor encounters. Finally, we found that the same identifiable dopamine neuron can encode different combinations of locomotion and odor in different individuals. If these dopamine neurons promote synaptic plasticity-like the dopamine neurons of the mushroom body-then, their tuning profiles would imply that plasticity depends on a flexible integration of sensory signals, motor signals, and recent experience.
Collapse
Affiliation(s)
- Michael Marquis
- Department of Neurobiology, Harvard Medical School, 220 Longwood Avenue, Boston, MA 02115, USA
| | - Rachel I Wilson
- Department of Neurobiology, Harvard Medical School, 220 Longwood Avenue, Boston, MA 02115, USA.
| |
Collapse
|
20
|
Mikus N, Korb S, Massaccesi C, Gausterer C, Graf I, Willeit M, Eisenegger C, Lamm C, Silani G, Mathys C. Effects of dopamine D2/3 and opioid receptor antagonism on the trade-off between model-based and model-free behaviour in healthy volunteers. eLife 2022; 11:e79661. [PMID: 36468832 PMCID: PMC9721617 DOI: 10.7554/elife.79661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 11/22/2022] [Indexed: 12/11/2022] Open
Abstract
Human behaviour requires flexible arbitration between actions we do out of habit and actions that are directed towards a specific goal. Drugs that target opioid and dopamine receptors are notorious for inducing maladaptive habitual drug consumption; yet, how the opioidergic and dopaminergic neurotransmitter systems contribute to the arbitration between habitual and goal-directed behaviour is poorly understood. By combining pharmacological challenges with a well-established decision-making task and a novel computational model, we show that the administration of the dopamine D2/3 receptor antagonist amisulpride led to an increase in goal-directed or 'model-based' relative to habitual or 'model-free' behaviour, whereas the non-selective opioid receptor antagonist naltrexone had no appreciable effect. The effect of amisulpride on model-based/model-free behaviour did not scale with drug serum levels in the blood. Furthermore, participants with higher amisulpride serum levels showed higher explorative behaviour. These findings highlight the distinct functional contributions of dopamine and opioid receptors to goal-directed and habitual behaviour and support the notion that even small doses of amisulpride promote flexible application of cognitive control.
Collapse
Affiliation(s)
- Nace Mikus
- Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of ViennaViennaAustria
- Interacting Minds Centre, Aarhus UniversityAarhusDenmark
| | - Sebastian Korb
- Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of ViennaViennaAustria
- Department of Psychology, University of EssexColchesterUnited Kingdom
| | - Claudia Massaccesi
- Department of Clinical and Health Psychology, Faculty of Psychology, University of ViennaViennaAustria
| | - Christian Gausterer
- FDZ‐Forensisches DNA Zentrallabor GmbH, Medical University of ViennaViennaAustria
| | - Irene Graf
- Department of Psychiatry and Psychotherapy, Medical University of ViennaViennaAustria
| | - Matthäus Willeit
- Department of Psychiatry and Psychotherapy, Medical University of ViennaViennaAustria
| | - Christoph Eisenegger
- Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of ViennaViennaAustria
| | - Claus Lamm
- Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of ViennaViennaAustria
| | - Giorgia Silani
- Department of Clinical and Health Psychology, Faculty of Psychology, University of ViennaViennaAustria
| | - Christoph Mathys
- Interacting Minds Centre, Aarhus UniversityAarhusDenmark
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH ZurichZurichSwitzerland
- Scuola Internazionale Superiore di Studi Avanzati (SISSA)TriesteItaly
| |
Collapse
|
21
|
Kutlu MG, Zachry JE, Melugin PR, Tat J, Cajigas S, Isiktas AU, Patel DD, Siciliano CA, Schoenbaum G, Sharpe MJ, Calipari ES. Dopamine signaling in the nucleus accumbens core mediates latent inhibition. Nat Neurosci 2022; 25:1071-1081. [PMID: 35902648 PMCID: PMC9768922 DOI: 10.1038/s41593-022-01126-1] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 06/21/2022] [Indexed: 11/09/2022]
Abstract
Studies investigating the neural mechanisms by which associations between cues and predicted outcomes control behavior often use associative learning frameworks to understand the neural control of behavior. These frameworks do not always account for the full range of effects that novelty can have on behavior and future associative learning. Here, in mice, we show that dopamine in the nucleus accumbens core is evoked by novel, neutral stimuli, and the trajectory of this response over time tracked habituation to these stimuli. Habituation to novel cues before associative learning reduced future associative learning, a phenomenon known as latent inhibition. Crucially, trial-by-trial dopamine response patterns tracked this phenomenon. Optogenetic manipulation of dopamine responses to the cue during the habituation period bidirectionally influenced future associative learning. Thus, dopamine signaling in the nucleus accumbens core has a causal role in novelty-based learning in a way that cannot be predicted based on purely associative factors.
Collapse
Affiliation(s)
- Munir Gunes Kutlu
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Jennifer E Zachry
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Patrick R Melugin
- Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN, USA
| | - Jennifer Tat
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Stephanie Cajigas
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Atagun U Isiktas
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Dev D Patel
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Cody A Siciliano
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Center for Addiction Research, Vanderbilt University, Nashville, TN, USA
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institutes on Drug Abuse, Baltimore, MD, USA
| | - Melissa J Sharpe
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Erin S Calipari
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Center for Addiction Research, Vanderbilt University, Nashville, TN, USA.
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA.
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
22
|
de Jong JW, Fraser KM, Lammel S. Mesoaccumbal Dopamine Heterogeneity: What Do Dopamine Firing and Release Have to Do with It? Annu Rev Neurosci 2022; 45:109-129. [PMID: 35226827 PMCID: PMC9271543 DOI: 10.1146/annurev-neuro-110920-011929] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Ventral tegmental area (VTA) dopamine (DA) neurons are often thought to uniformly encode reward prediction errors. Conversely, DA release in the nucleus accumbens (NAc), the prominent projection target of these neurons, has been implicated in reinforcement learning, motivation, aversion, and incentive salience. This contrast between heterogeneous functions of DA release versus a homogeneous role for DA neuron activity raises numerous questions regarding how VTA DA activity translates into NAc DA release. Further complicating this issue is increasing evidence that distinct VTA DA projections into defined NAc subregions mediate diverse behavioral functions. Here, we evaluate evidence for heterogeneity within the mesoaccumbal DA system and argue that frameworks of DA function must incorporate the precise topographic organization of VTA DA neurons to clarify their contribution to health and disease.
Collapse
Affiliation(s)
- Johannes W de Jong
- Department of Molecular and Cell Biology and Helen Wills Neuroscience Institute, University of California, Berkeley, California, USA;
| | - Kurt M Fraser
- Department of Molecular and Cell Biology and Helen Wills Neuroscience Institute, University of California, Berkeley, California, USA;
| | - Stephan Lammel
- Department of Molecular and Cell Biology and Helen Wills Neuroscience Institute, University of California, Berkeley, California, USA;
| |
Collapse
|
23
|
Seitz BM, Hoang IB, DiFazio LE, Blaisdell AP, Sharpe MJ. Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner. Curr Biol 2022; 32:3210-3218.e3. [PMID: 35752165 DOI: 10.1016/j.cub.2022.06.035] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 04/29/2022] [Accepted: 06/13/2022] [Indexed: 01/06/2023]
Abstract
For over two decades, phasic activity in midbrain dopamine neurons was considered synonymous with the prediction error in temporal-difference reinforcement learning.1-4 Central to this proposal is the notion that reward-predictive stimuli become endowed with the scalar value of predicted rewards. When these cues are subsequently encountered, their predictive value is compared to the value of the actual reward received, allowing for the calculation of prediction errors.5,6 Phasic firing of dopamine neurons was proposed to reflect this computation,1,2 facilitating the backpropagation of value from the predicted reward to the reward-predictive stimulus, thus reducing future prediction errors. There are two critical assumptions of this proposal: (1) that dopamine errors can only facilitate learning about scalar value and not more complex features of predicted rewards, and (2) that the dopamine signal can only be involved in anticipatory cue-reward learning in which cues or actions precede rewards. Recent work7-15 has challenged the first assumption, demonstrating that phasic dopamine signals across species are involved in learning about more complex features of the predicted outcomes, in a manner that transcends this value computation. Here, we tested the validity of the second assumption. Specifically, we examined whether phasic midbrain dopamine activity would be necessary for backward conditioning-when a neutral cue reliably follows a rewarding outcome.16-20 Using a specific Pavlovian-to-instrumental transfer (PIT) procedure,21-23 we show rats learn both excitatory and inhibitory components of a backward association, and that this association entails knowledge of the specific identity of the reward and cue. We demonstrate that brief optogenetic inhibition of VTADA neurons timed to the transition between the reward and cue reduces both of these components of backward conditioning. These findings suggest VTADA neurons are capable of facilitating associations between contiguously occurring events, regardless of the content of those events. We conclude that these data may be in line with suggestions that the VTADA error acts as a universal teaching signal. This may provide insight into why dopamine function has been implicated in myriad psychological disorders that are characterized by very distinct reinforcement-learning deficits.
Collapse
Affiliation(s)
- Benjamin M Seitz
- Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA
| | - Ivy B Hoang
- Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA
| | - Lauren E DiFazio
- Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA
| | - Aaron P Blaisdell
- Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA
| | - Melissa J Sharpe
- Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA.
| |
Collapse
|
24
|
Grujic N, Brus J, Burdakov D, Polania R. Rational inattention in mice. SCIENCE ADVANCES 2022; 8:eabj8935. [PMID: 35245128 PMCID: PMC8896787 DOI: 10.1126/sciadv.abj8935] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Behavior exhibited by humans and other organisms is generally inconsistent and biased and, thus, is often labeled irrational. However, the origins of this seemingly suboptimal behavior remain elusive. We developed a behavioral task and normative framework to reveal how organisms should allocate their limited processing resources such that sensory precision and its related metabolic investment are balanced to guarantee maximal utility. We found that mice act as rational inattentive agents by adaptively allocating their sensory resources in a way that maximizes reward consumption in previously unexperienced stimulus-reward association environments. Unexpectedly, perception of commonly occurring stimuli was relatively imprecise; however, this apparent statistical fallacy implies "awareness" and efficient adaptation to their neurocognitive limitations. Arousal systems carry reward distribution information of sensory signals, and distributional reinforcement learning mechanisms regulate sensory precision via top-down normalization. These findings reveal how organisms efficiently perceive and adapt to previously unexperienced environmental contexts within the constraints imposed by neurobiology.
Collapse
Affiliation(s)
- Nikola Grujic
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Neuroscience Center Zürich, Zurich, Switzerland
| | - Jeroen Brus
- Neuroscience Center Zürich, Zurich, Switzerland
- Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
| | - Denis Burdakov
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Neuroscience Center Zürich, Zurich, Switzerland
- Corresponding author. (R.P.); (D.B.)
| | - Rafael Polania
- Neuroscience Center Zürich, Zurich, Switzerland
- Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Corresponding author. (R.P.); (D.B.)
| |
Collapse
|
25
|
Whittington JC, Behrens TE. Reinforcement learning: Dopamine ramps with fuzzy value estimates. Curr Biol 2022; 32:R213-R215. [DOI: 10.1016/j.cub.2022.01.070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
26
|
Millard SJ, Bearden CE, Karlsgodt KH, Sharpe MJ. The prediction-error hypothesis of schizophrenia: new data point to circuit-specific changes in dopamine activity. Neuropsychopharmacology 2022; 47:628-640. [PMID: 34588607 PMCID: PMC8782867 DOI: 10.1038/s41386-021-01188-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 08/23/2021] [Accepted: 09/07/2021] [Indexed: 02/07/2023]
Abstract
Schizophrenia is a severe psychiatric disorder affecting 21 million people worldwide. People with schizophrenia suffer from symptoms including psychosis and delusions, apathy, anhedonia, and cognitive deficits. Strikingly, schizophrenia is characterised by a learning paradox involving difficulties learning from rewarding events, whilst simultaneously 'overlearning' about irrelevant or neutral information. While dysfunction in dopaminergic signalling has long been linked to the pathophysiology of schizophrenia, a cohesive framework that accounts for this learning paradox remains elusive. Recently, there has been an explosion of new research investigating how dopamine contributes to reinforcement learning, which illustrates that midbrain dopamine contributes in complex ways to reinforcement learning, not previously envisioned. This new data brings new possibilities for how dopamine signalling contributes to the symptomatology of schizophrenia. Building on recent work, we present a new neural framework for how we might envision specific dopamine circuits contributing to this learning paradox in schizophrenia in the context of models of reinforcement learning. Further, we discuss avenues of preclinical research with the use of cutting-edge neuroscience techniques where aspects of this model may be tested. Ultimately, it is hoped that this review will spur to action more research utilising specific reinforcement learning paradigms in preclinical models of schizophrenia, to reconcile seemingly disparate symptomatology and develop more efficient therapeutics.
Collapse
Affiliation(s)
- Samuel J Millard
- Department of Psychology, University of California, Los Angeles, CA, 90095, USA.
| | - Carrie E Bearden
- Department of Psychology, University of California, Los Angeles, CA, 90095, USA
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA, 90095, USA
| | - Katherine H Karlsgodt
- Department of Psychology, University of California, Los Angeles, CA, 90095, USA
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA, 90095, USA
| | - Melissa J Sharpe
- Department of Psychology, University of California, Los Angeles, CA, 90095, USA.
| |
Collapse
|
27
|
Liakoni V, Lehmann MP, Modirshanechi A, Brea J, Lutti A, Gerstner W, Preuschoff K. Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making. Neuroimage 2021; 246:118780. [PMID: 34875383 DOI: 10.1016/j.neuroimage.2021.118780] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 08/03/2021] [Accepted: 12/04/2021] [Indexed: 11/25/2022] Open
Abstract
Learning how to reach a reward over long series of actions is a remarkable capability of humans, and potentially guided by multiple parallel learning modules. Current brain imaging of learning modules is limited by (i) simple experimental paradigms, (ii) entanglement of brain signals of different learning modules, and (iii) a limited number of computational models considered as candidates for explaining behavior. Here, we address these three limitations and (i) introduce a complex sequential decision making task with surprising events that allows us to (ii) dissociate correlates of reward prediction errors from those of surprise in functional magnetic resonance imaging (fMRI); and (iii) we test behavior against a large repertoire of model-free, model-based, and hybrid reinforcement learning algorithms, including a novel surprise-modulated actor-critic algorithm. Surprise, derived from an approximate Bayesian approach for learning the world-model, is extracted in our algorithm from a state prediction error. Surprise is then used to modulate the learning rate of a model-free actor, which itself learns via the reward prediction error from model-free value estimation by the critic. We find that action choices are well explained by pure model-free policy gradient, but reaction times and neural data are not. We identify signatures of both model-free and surprise-based learning signals in blood oxygen level dependent (BOLD) responses, supporting the existence of multiple parallel learning modules in the brain. Our results extend previous fMRI findings to a multi-step setting and emphasize the role of policy gradient and surprise signalling in human learning.
Collapse
Affiliation(s)
- Vasiliki Liakoni
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland.
| | - Marco P Lehmann
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Alireza Modirshanechi
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Johanni Brea
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Antoine Lutti
- Laboratoire de recherche en neuroimagerie (LREN), Department of Clinical Neurosciences, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Wulfram Gerstner
- École Polytechnique Fédérale de Lausanne (EPFL), School of Computer and Communication Sciences and School of Life Sciences, Lausanne, Switzerland
| | - Kerstin Preuschoff
- Geneva Finance Research Institute & Interfaculty Center for Affective Sciences, University of Geneva, Geneva, Switzerland
| |
Collapse
|
28
|
Frässle S, Aponte EA, Bollmann S, Brodersen KH, Do CT, Harrison OK, Harrison SJ, Heinzle J, Iglesias S, Kasper L, Lomakina EI, Mathys C, Müller-Schrader M, Pereira I, Petzschner FH, Raman S, Schöbi D, Toussaint B, Weber LA, Yao Y, Stephan KE. TAPAS: An Open-Source Software Package for Translational Neuromodeling and Computational Psychiatry. Front Psychiatry 2021; 12:680811. [PMID: 34149484 PMCID: PMC8206497 DOI: 10.3389/fpsyt.2021.680811] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 05/10/2021] [Indexed: 12/26/2022] Open
Abstract
Psychiatry faces fundamental challenges with regard to mechanistically guided differential diagnosis, as well as prediction of clinical trajectories and treatment response of individual patients. This has motivated the genesis of two closely intertwined fields: (i) Translational Neuromodeling (TN), which develops "computational assays" for inferring patient-specific disease processes from neuroimaging, electrophysiological, and behavioral data; and (ii) Computational Psychiatry (CP), with the goal of incorporating computational assays into clinical decision making in everyday practice. In order to serve as objective and reliable tools for clinical routine, computational assays require end-to-end pipelines from raw data (input) to clinically useful information (output). While these are yet to be established in clinical practice, individual components of this general end-to-end pipeline are being developed and made openly available for community use. In this paper, we present the Translational Algorithms for Psychiatry-Advancing Science (TAPAS) software package, an open-source collection of building blocks for computational assays in psychiatry. Collectively, the tools in TAPAS presently cover several important aspects of the desired end-to-end pipeline, including: (i) tailored experimental designs and optimization of measurement strategy prior to data acquisition, (ii) quality control during data acquisition, and (iii) artifact correction, statistical inference, and clinical application after data acquisition. Here, we review the different tools within TAPAS and illustrate how these may help provide a deeper understanding of neural and cognitive mechanisms of disease, with the ultimate goal of establishing automatized pipelines for predictions about individual patients. We hope that the openly available tools in TAPAS will contribute to the further development of TN/CP and facilitate the translation of advances in computational neuroscience into clinically relevant computational assays.
Collapse
Affiliation(s)
- Stefan Frässle
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Eduardo A. Aponte
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Saskia Bollmann
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Institute for Biomedical Engineering, ETH Zurich and University of Zurich, Zurich, Switzerland
- Centre for Advanced Imaging, The University of Queensland, Brisbane, QLD, Australia
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA, United States
- Department of Radiology, Harvard Medical School, Charlestown, MA, United States
| | - Kay H. Brodersen
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Cao T. Do
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Olivia K. Harrison
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
- School of Pharmacy, University of Otago, Dunedin, New Zealand
| | - Samuel J. Harrison
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Jakob Heinzle
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Sandra Iglesias
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Lars Kasper
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Techna Institute, University Health Network, Toronto, ON, Canada
| | - Ekaterina I. Lomakina
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Christoph Mathys
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
- Interacting Minds Center, Aarhus University, Aarhus, Denmark
| | - Matthias Müller-Schrader
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Inês Pereira
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Frederike H. Petzschner
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Sudhir Raman
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Dario Schöbi
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Birte Toussaint
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Lilian A. Weber
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Yu Yao
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Klaas E. Stephan
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich and ETH Zurich, Zurich, Switzerland
| |
Collapse
|
29
|
Abstract
Experiments have implicated dopamine in model-based reinforcement learning (RL). These findings are unexpected as dopamine is thought to encode a reward prediction error (RPE), which is the key teaching signal in model-free RL. Here we examine two possible accounts for dopamine's involvement in model-based RL: the first that dopamine neurons carry a prediction error used to update a type of predictive state representation called a successor representation, the second that two well established aspects of dopaminergic activity, RPEs and surprise signals, can together explain dopamine's involvement in model-based RL.
Collapse
|
30
|
Lerner TN, Holloway AL, Seiler JL. Dopamine, Updated: Reward Prediction Error and Beyond. Curr Opin Neurobiol 2021; 67:123-130. [PMID: 33197709 PMCID: PMC8116345 DOI: 10.1016/j.conb.2020.10.012] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 10/12/2020] [Accepted: 10/14/2020] [Indexed: 01/10/2023]
Abstract
Dopamine neurons have been intensely studied for their roles in reinforcement learning. A dominant theory of how these neurons contribute to learning is through the encoding of a reward prediction error (RPE) signal. Recent advances in dopamine research have added nuance to RPE theory by incorporating the ideas of sensory prediction error, distributional encoding, and belief states. Further nuance is likely to be added shortly by convergent lines of research on dopamine neuron diversity. Finally, a major challenge is to reconcile RPE theory with other current theories of dopamine function to account for dopamine's role in movement, motivation, and goal-directed planning.
Collapse
Affiliation(s)
- Talia N Lerner
- Feinberg School of Medicine and Department of Physiology, Northwestern University, Chicago, IL, USA; Northwestern University Interdepartmental Neuroscience Program, Chicago, IL, USA.
| | - Ashley L Holloway
- Feinberg School of Medicine and Department of Physiology, Northwestern University, Chicago, IL, USA; Northwestern University Interdepartmental Neuroscience Program, Chicago, IL, USA
| | - Jillian L Seiler
- Feinberg School of Medicine and Department of Physiology, Northwestern University, Chicago, IL, USA; Department of Psychology, University of Illinois at Chicago, Chicago, IL, USA
| |
Collapse
|
31
|
Speranza L, di Porzio U, Viggiano D, de Donato A, Volpicelli F. Dopamine: The Neuromodulator of Long-Term Synaptic Plasticity, Reward and Movement Control. Cells 2021; 10:735. [PMID: 33810328 PMCID: PMC8066851 DOI: 10.3390/cells10040735] [Citation(s) in RCA: 169] [Impact Index Per Article: 42.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 03/20/2021] [Accepted: 03/23/2021] [Indexed: 01/11/2023] Open
Abstract
Dopamine (DA) is a key neurotransmitter involved in multiple physiological functions including motor control, modulation of affective and emotional states, reward mechanisms, reinforcement of behavior, and selected higher cognitive functions. Dysfunction in dopaminergic transmission is recognized as a core alteration in several devastating neurological and psychiatric disorders, including Parkinson's disease (PD), schizophrenia, bipolar disorder, attention deficit hyperactivity disorder (ADHD) and addiction. Here we will discuss the current insights on the role of DA in motor control and reward learning mechanisms and its involvement in the modulation of synaptic dynamics through different pathways. In particular, we will consider the role of DA as neuromodulator of two forms of synaptic plasticity, known as long-term potentiation (LTP) and long-term depression (LTD) in several cortical and subcortical areas. Finally, we will delineate how the effect of DA on dendritic spines places this molecule at the interface between the motor and the cognitive systems. Specifically, we will be focusing on PD, vascular dementia, and schizophrenia.
Collapse
Affiliation(s)
- Luisa Speranza
- Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA;
| | - Umberto di Porzio
- Institute of Genetics and Biophysics “Adriano Buzzati Traverso”, CNR, 80131 Naples, Italy
| | - Davide Viggiano
- Department of Translational Medical Sciences, Genetic Research Institute “Gaetano Salvatore”, University of Campania “L. Vanvitelli”, IT and Biogem S.c.a.r.l., 83031 Ariano Irpino, Italy; (D.V.); (A.d.D.)
| | - Antonio de Donato
- Department of Translational Medical Sciences, Genetic Research Institute “Gaetano Salvatore”, University of Campania “L. Vanvitelli”, IT and Biogem S.c.a.r.l., 83031 Ariano Irpino, Italy; (D.V.); (A.d.D.)
| | - Floriana Volpicelli
- Department of Pharmacy, School of Medicine and Surgery, University of Naples Federico II, 80131 Naples, Italy;
| |
Collapse
|
32
|
Howard JD, Kahnt T. To be specific: The role of orbitofrontal cortex in signaling reward identity. Behav Neurosci 2021; 135:210-217. [PMID: 33734730 DOI: 10.1037/bne0000455] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The orbitofrontal cortex (OFC) plays a prominent role in signaling reward expectations. Two important features of rewards are their value (how good they are) and their specific identity (what they are). Whereas research on OFC has traditionally focused on reward value, recent findings point toward a pivotal role of reward identity in understanding OFC signaling and its contribution to behavior. Here, we review work in rodents, nonhuman primates, and humans on how the OFC represents expectations about the identity of rewards, and how these signals contribute to outcome-guided behavior. Moreover, we summarize recent findings suggesting that specific reward expectations in OFC are learned and updated by means of identity errors in the dopaminergic midbrain. We conclude by discussing how OFC encoding of specific rewards complements recent proposals that this region represents a cognitive map of relevant task states, which forms the basis for model-based behavior. (PsycInfo Database Record (c) 2021 APA, all rights reserved).
Collapse
|
33
|
Abstract
Importance The tools and insights of behavioral neuroscience grow apace, yet their clinical application is lagging. Observations This article suggests that associative learning theory may be the algorithmic bridge to connect a burgeoning understanding of the brain with the challenges to the mind with which all clinicians and researchers are concerned. Conclusions and Relevance Instead of giving up, talking past one another, or resting on the laurels of face validity, a consilient and collaborative approach is suggested: visiting laboratory meetings and clinical rounds and attempting to converse in the language of behavior and cognition to better understand and ultimately treat patients.
Collapse
Affiliation(s)
- Philip R Corlett
- Clinical Neuroscience Research Unit, Department of Psychiatry, Yale University, School of Medicine, New Haven, Connecticut
| | - Geoffrey Schoenbaum
- National Institute on Drug Abuse Intramural Research Program, Baltimore, Maryland
| |
Collapse
|
34
|
Kim HR, Malik AN, Mikhael JG, Bech P, Tsutsui-Kimura I, Sun F, Zhang Y, Li Y, Watabe-Uchida M, Gershman SJ, Uchida N. A Unified Framework for Dopamine Signals across Timescales. Cell 2020; 183:1600-1616.e25. [PMID: 33248024 PMCID: PMC7736562 DOI: 10.1016/j.cell.2020.11.013] [Citation(s) in RCA: 157] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 08/20/2020] [Accepted: 11/09/2020] [Indexed: 01/06/2023]
Abstract
Rapid phasic activity of midbrain dopamine neurons is thought to signal reward prediction errors (RPEs), resembling temporal difference errors used in machine learning. However, recent studies describing slowly increasing dopamine signals have instead proposed that they represent state values and arise independent from somatic spiking activity. Here we developed experimental paradigms using virtual reality that disambiguate RPEs from values. We examined dopamine circuit activity at various stages, including somatic spiking, calcium signals at somata and axons, and striatal dopamine concentrations. Our results demonstrate that ramping dopamine signals are consistent with RPEs rather than value, and this ramping is observed at all stages examined. Ramping dopamine signals can be driven by a dynamic stimulus that indicates a gradual approach to a reward. We provide a unified computational understanding of rapid phasic and slowly ramping dopamine signals: dopamine neurons perform a derivative-like computation over values on a moment-by-moment basis.
Collapse
Affiliation(s)
- HyungGoo R Kim
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| | - Athar N Malik
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA; Department of Neurosurgery, Massachusetts General Hospital, 55 Fruit Street, Boston, MA 02114, USA
| | - John G Mikhael
- Program in Neuroscience, Harvard Medical School, 220 Longwood Avenue, Boston, MA 02115, USA; MD-PhD Program, Harvard Medical School, 260 Longwood Avenue, Boston, MA 02115, USA
| | - Pol Bech
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | - Iku Tsutsui-Kimura
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | - Fangmiao Sun
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; PKU-IDG/McGovern Institute for Brain Research, Beijing 100871, China
| | - Yajun Zhang
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; PKU-IDG/McGovern Institute for Brain Research, Beijing 100871, China
| | - Yulong Li
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; PKU-IDG/McGovern Institute for Brain Research, Beijing 100871, China
| | - Mitsuko Watabe-Uchida
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | - Samuel J Gershman
- Department of Psychology, Center for Brain Science, Harvard University, 52 Oxford Street, Cambridge, MA 02138, USA
| | - Naoshige Uchida
- Center for Brain Science, Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| |
Collapse
|
35
|
Iglesias S, Kasper L, Harrison SJ, Manka R, Mathys C, Stephan KE. Cholinergic and dopaminergic effects on prediction error and uncertainty responses during sensory associative learning. Neuroimage 2020; 226:117590. [PMID: 33285332 DOI: 10.1016/j.neuroimage.2020.117590] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 10/20/2020] [Accepted: 11/19/2020] [Indexed: 01/11/2023] Open
Abstract
Navigating the physical world requires learning probabilistic associations between sensory events and their change in time (volatility). Bayesian accounts of this learning process rest on hierarchical prediction errors (PEs) that are weighted by estimates of uncertainty (or its inverse, precision). In a previous fMRI study we found that low-level precision-weighted PEs about visual outcomes (that update beliefs about associations) activated the putative dopaminergic midbrain; by contrast, precision-weighted PEs about cue-outcome associations (that update beliefs about volatility) activated the cholinergic basal forebrain. These findings suggested selective dopaminergic and cholinergic influences on precision-weighted PEs at different hierarchical levels. Here, we tested this hypothesis, repeating our fMRI study under pharmacological manipulations in healthy participants. Specifically, we performed two pharmacological fMRI studies with a between-subject double-blind placebo-controlled design: study 1 used antagonists of dopaminergic (amisulpride) and muscarinic (biperiden) receptors, study 2 used enhancing drugs of dopaminergic (levodopa) and cholinergic (galantamine) modulation. Pooled across all pharmacological conditions of study 1 and study 2, respectively, we found that low-level precision-weighted PEs activated the midbrain and high-level precision-weighted PEs the basal forebrain as in our previous study. However, we found pharmacological effects on brain activity associated with these computational quantities only when splitting the precision-weighted PEs into their PE and precision components: in a brainstem region putatively containing cholinergic (pedunculopontine and laterodorsal tegmental) nuclei, biperiden (compared to placebo) enhanced low-level PE responses and attenuated high-level PE activity, while amisulpride reduced high-level PE responses. Additionally, in the putative dopaminergic midbrain, galantamine compared to placebo enhanced low-level PE responses (in a body-weight dependent manner) and amisulpride enhanced high-level precision activity. Task behaviour was not affected by any of the drugs. These results do not support our hypothesis of a clear-cut dichotomy between different hierarchical inference levels and neurotransmitter systems, but suggest a more complex interaction between these neuromodulatory systems and hierarchical Bayesian quantities. However, our present results may have been affected by confounds inherent to pharmacological fMRI. We discuss these confounds and outline improved experimental tests for the future.
Collapse
Affiliation(s)
- Sandra Iglesias
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich & Swiss Federal Institute of Technology (ETH Zurich), Wilfriedstr. 6, 8032 Zurich, Switzerland.
| | - Lars Kasper
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich & Swiss Federal Institute of Technology (ETH Zurich), Wilfriedstr. 6, 8032 Zurich, Switzerland; Institute for Biomedical Engineering, ETH Zurich and University of Zurich, Switzerland
| | - Samuel J Harrison
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich & Swiss Federal Institute of Technology (ETH Zurich), Wilfriedstr. 6, 8032 Zurich, Switzerland
| | - Robert Manka
- Department of Cardiology, University Hospital Zurich, Switzerland
| | - Christoph Mathys
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich & Swiss Federal Institute of Technology (ETH Zurich), Wilfriedstr. 6, 8032 Zurich, Switzerland; Interacting Minds Centre, Aarhus University, Aarhus, Denmark
| | - Klaas E Stephan
- Translational Neuromodeling Unit (TNU), Institute for Biomedical Engineering, University of Zurich & Swiss Federal Institute of Technology (ETH Zurich), Wilfriedstr. 6, 8032 Zurich, Switzerland; Max Planck Institute for Metabolism Research, Cologne, Germany
| |
Collapse
|
36
|
Barron HC, Reeve HM, Koolschijn RS, Perestenko PV, Shpektor A, Nili H, Rothaermel R, Campo-Urriza N, O'Reilly JX, Bannerman DM, Behrens TEJ, Dupret D. Neuronal Computation Underlying Inferential Reasoning in Humans and Mice. Cell 2020; 183:228-243.e21. [PMID: 32946810 PMCID: PMC7116148 DOI: 10.1016/j.cell.2020.08.035] [Citation(s) in RCA: 95] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 05/10/2020] [Accepted: 08/18/2020] [Indexed: 12/17/2022]
Abstract
Every day we make decisions critical for adaptation and survival. We repeat actions with known consequences. But we also draw on loosely related events to infer and imagine the outcome of entirely novel choices. These inferential decisions are thought to engage a number of brain regions; however, the underlying neuronal computation remains unknown. Here, we use a multi-day cross-species approach in humans and mice to report the functional anatomy and neuronal computation underlying inferential decisions. We show that during successful inference, the mammalian brain uses a hippocampal prospective code to forecast temporally structured learned associations. Moreover, during resting behavior, coactivation of hippocampal cells in sharp-wave/ripples represent inferred relationships that include reward, thereby "joining-the-dots" between events that have not been observed together but lead to profitable outcomes. Computing mnemonic links in this manner may provide an important mechanism to build a cognitive map that stretches beyond direct experience, thus supporting flexible behavior.
Collapse
Affiliation(s)
- Helen C Barron
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK; Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK.
| | - Hayley M Reeve
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK
| | - Renée S Koolschijn
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Pavel V Perestenko
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK
| | - Anna Shpektor
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Hamed Nili
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Roman Rothaermel
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK
| | - Natalia Campo-Urriza
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK
| | - Jill X O'Reilly
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK; Department of Experimental Psychology, University of Oxford, 15 Parks Road, Oxford OX1 3AQ, UK
| | - David M Bannerman
- Department of Experimental Psychology, University of Oxford, 15 Parks Road, Oxford OX1 3AQ, UK
| | - Timothy E J Behrens
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, FMRIB, John Radcliffe Hospital, Oxford OX3 9DU, UK; The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London WC1N 3BG, UK
| | - David Dupret
- Medical Research Council Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Mansfield Road, Oxford OX1 3TH, UK.
| |
Collapse
|
37
|
Context-Dependent Multiplexing by Individual VTA Dopamine Neurons. J Neurosci 2020; 40:7489-7509. [PMID: 32859713 DOI: 10.1523/jneurosci.0502-20.2020] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Revised: 07/25/2020] [Accepted: 08/03/2020] [Indexed: 01/13/2023] Open
Abstract
Dopamine (DA) neurons of the VTA track cues and rewards to generate a reward prediction error signal during Pavlovian conditioning. Here we explored how these neurons respond to a self-paced, operant task in freely moving mice. The animal could trigger a reward-predicting cue by remaining in a specific location of an operant box for a brief time before moving to a spout for reward collection. VTA DA neurons were identified using DAT-Cre male mice that carried an optrode with minimal impact on the behavioral task. In vivo single-unit recordings revealed transient fast spiking responses to the cue and reward in correct trials, while for incorrect ones the activity paused, reflecting positive and negative error signals of a reward prediction. In parallel, a majority of VTA DA neurons simultaneously encoded multiple actions (e.g., movement velocity, acceleration, distance to goal, and licking) in sustained slow firing modulation. Applying a GLM, we show that such multiplexed encoding of rewarding and motor variables by individual DA neurons was only apparent while the mouse was engaged in the task. Downstream targets may exploit such goal-directed multiplexing of VTA DA neurons to adjust actions to optimize the task's outcome.SIGNIFICANCE STATEMENT VTA DA neurons code for multiple functions, including the reward prediction error but also motivation and locomotion. Here we show that about half of the recorded VTA DA neurons perform multiplexing: they exploit the phasic and tonic activity modes to encode, respectively, the cue/reward responses and motor parameters, most prominently when the mouse engages in a self-paced operand task. VTA non-DA neurons, by contrast, encode motor parameters regardless of task engagement.
Collapse
|
38
|
Abstract
Humans and animals navigate uncertain environments by seeking information about the future. Remarkably, we often seek information even when it has no instrumental value for aiding our decisions - as if the information is a source of value in its own right. In recent years, there has been a flourishing of research into these non-instrumental information preferences and their implementation in the brain. Individuals value information about uncertain future rewards, and do so for multiple reasons, including valuing resolution of uncertainty and overweighting desirable information. The brain motivates this information seeking by tapping into some of the same circuitry as primary rewards like food and water. However, it also employs cortex and basal ganglia circuitry that predicts and values information as distinct from primary reward. Uncovering how these circuits cooperate will be fundamental to understanding information seeking and motivated behavior as a whole, in our increasingly complex and information-rich world.
Collapse
Affiliation(s)
| | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA.,Department of Biomedical Engineering, Washington University, St. Louis, MO, USA.,Department of Neurosurgery, Washington University, St. Louis, MO, USA.,Pain Center, Washington University, St. Louis, MO, USA
| |
Collapse
|
39
|
Collins AL, Saunders BT. Heterogeneity in striatal dopamine circuits: Form and function in dynamic reward seeking. J Neurosci Res 2020; 98:1046-1069. [PMID: 32056298 PMCID: PMC7183907 DOI: 10.1002/jnr.24587] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 01/08/2020] [Accepted: 01/16/2020] [Indexed: 01/03/2023]
Abstract
The striatal dopamine system has long been studied in the context of reward learning, motivation, and movement. Given the prominent role dopamine plays in a variety of adaptive behavioral states, as well as diseases like addiction, it is essential to understand the full complexity of dopamine neurons and the striatal systems they target. A growing number of studies are uncovering details of the heterogeneity in dopamine neuron subpopulations. Here, we review that work to synthesize current understanding of dopamine system heterogeneity across three levels, anatomical organization, functions in behavior, and modes of action, wherein we focus on signaling profiles and local mechanisms for modulation of dopamine release. Together, these studies reveal new and emerging dimensions of the striatal dopamine system, informing its contribution to dynamic motivational and decision-making processes.
Collapse
Affiliation(s)
- Anne L. Collins
- University of Minnesota, Department of Neuroscience, Medical Discovery Team on Addiction, Minneapolis, MN 55455
| | - Benjamin T. Saunders
- University of Minnesota, Department of Neuroscience, Medical Discovery Team on Addiction, Minneapolis, MN 55455
| |
Collapse
|