1
|
Greenstreet F, Vergara HM, Johansson Y, Pati S, Schwarz L, Lenzi SC, Geerts JP, Wisdom M, Gubanova A, Rollik LB, Kaur J, Moskovitz T, Cohen J, Thompson E, Margrie TW, Clopath C, Stephenson-Jones M. Dopaminergic action prediction errors serve as a value-free teaching signal. Nature 2025:10.1038/s41586-025-09008-9. [PMID: 40369067 DOI: 10.1038/s41586-025-09008-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Accepted: 04/10/2025] [Indexed: 05/16/2025]
Abstract
Choice behaviour of animals is characterized by two main tendencies: taking actions that led to rewards and repeating past actions1,2. Theory suggests that these strategies may be reinforced by different types of dopaminergic teaching signals: reward prediction error to reinforce value-based associations and movement-based action prediction errors to reinforce value-free repetitive associations3-6. Here we use an auditory discrimination task in mice to show that movement-related dopamine activity in the tail of the striatum encodes the hypothesized action prediction error signal. Causal manipulations reveal that this prediction error serves as a value-free teaching signal that supports learning by reinforcing repeated associations. Computational modelling and experiments demonstrate that action prediction errors alone cannot support reward-guided learning, but when paired with the reward prediction error circuitry they serve to consolidate stable sound-action associations in a value-free manner. Together we show that there are two types of dopaminergic prediction errors that work in tandem to support learning, each reinforcing different types of association in different striatal areas.
Collapse
Affiliation(s)
- Francesca Greenstreet
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Hernando Martinez Vergara
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
- Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
| | - Yvonne Johansson
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Sthitapranjya Pati
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Laura Schwarz
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Stephen C Lenzi
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Jesse P Geerts
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
- Bioengineering Department, Imperial College London, London, UK
| | - Matthew Wisdom
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Alina Gubanova
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Lars B Rollik
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Jasvin Kaur
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Theodore Moskovitz
- Gatsby Computational Neuroscience Unit, University College London, London, UK
| | - Joseph Cohen
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Emmett Thompson
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Troy W Margrie
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
| | - Claudia Clopath
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
- Bioengineering Department, Imperial College London, London, UK
| | - Marcus Stephenson-Jones
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK.
| |
Collapse
|
2
|
Wang H, Ortega HK, Kelly EB, Indajang J, Savalia NK, Glaeser-Khan S, Feng J, Li Y, Kaye AP, Kwan AC. Frontal noradrenergic and cholinergic transients exhibit distinct spatiotemporal dynamics during competitive decision-making. SCIENCE ADVANCES 2025; 11:eadr9916. [PMID: 40138407 PMCID: PMC11939063 DOI: 10.1126/sciadv.adr9916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Accepted: 02/20/2025] [Indexed: 03/29/2025]
Abstract
Norepinephrine (NE) and acetylcholine (ACh) are crucial for learning and decision-making. In the cortex, NE and ACh are released transiently at specific sites along neuromodulatory axons, but how the spatiotemporal patterns of NE and ACh signaling link to behavioral events is unknown. Here, we use two-photon microscopy to visualize neuromodulatory signals in the premotor cortex (medial M2) as mice engage in a competitive matching pennies game. Spatially, NE signals are more segregated with choice and outcome encoded at distinct locations, whereas ACh signals can multiplex and reflect different behavioral correlates at the same site. Temporally, task-driven NE transients were more synchronized and peaked earlier than ACh transients. To test functional relevance, we stimulated neuromodulatory signals using optogenetics to find that NE, but not ACh, increases the animals' propensity to explore alternate options. Together, the results reveal distinct subcellular spatiotemporal patterns of ACh and NE transients during decision-making in mice.
Collapse
Affiliation(s)
- Hongli Wang
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511, USA
| | - Heather K. Ortega
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511, USA
| | - Emma B. Kelly
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY 14853, USA
| | - Jonathan Indajang
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY 14853, USA
| | - Neil K. Savalia
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY 14853, USA
- Medical Scientist Training Program, Yale University School of Medicine, New Haven, CT 06511, USA
| | - Samira Glaeser-Khan
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511, USA
| | - Jiesi Feng
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing, China
| | - Yulong Li
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing, China
- PKU-IDG/McGovern Institute for Brain Research, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Chinese Institute for Brain Research, Beijing, China
| | - Alfred P. Kaye
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511, USA
- VA National Center for PTSD Clinical Neuroscience Division, West Haven, CT 06477, USA
- Wu Tsai Institute, New Haven, CT 06511, USA
| | - Alex C. Kwan
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY 14853, USA
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10065, USA
| |
Collapse
|
3
|
Corlett PR, Fraser KM. 20 Years of Aberrant Salience in Psychosis: What Have We Learned? Am J Psychiatry 2025:appiajp20240556. [PMID: 40134268 DOI: 10.1176/appi.ajp.20240556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]
Abstract
Twenty years ago Shitij Kapur's "Psychosis as a state of aberrant salience" captured the attention of clinicians and cognitive and behavioral neuroscientists. It has become the de facto way of talking about delusion formation in labs and clinics. Here, evidence for this theory is critically evaluated in consideration of evolving data since its publication. A particular focus is placed on its specific predictions regarding the neural and behavioral loci of dopamine dysfunction in psychosis and finds them lacking. This examination is informed by recent advances in the understanding of the function of the dopamine system and its impacts on behavior following the explosion of new tools and probes for precise measurement and manipulation of dopaminergic circuits. Contemporary theories that have developed since Kapur-which suggest a role for dopamine in belief formation, belief updating under uncertainty, and abductive inference to the best explanation for some set of circumstances-are argued to form a more cogent theory that fits better with the work in patients with delusions and hallucinations, how they behave, and what is known about the function of their dopamine system. The original salience hypothesis has been influential as it attempted to unite neurochemical dysfunction with clinical phenomenology through computational cognitive neuroscience, which has led to the development of novel predictions that the authors highlight as future directions for the field.
Collapse
Affiliation(s)
- Philip R Corlett
- Wu Tsai Institute, Departments of Psychiatry and Psychology, Yale University, New Haven (Corlett); Department of Psychology, University of Minnesota, Minneapolis (Fraser)
| | - Kurt M Fraser
- Wu Tsai Institute, Departments of Psychiatry and Psychology, Yale University, New Haven (Corlett); Department of Psychology, University of Minnesota, Minneapolis (Fraser)
| |
Collapse
|
4
|
Qian L, Burrell M, Hennig JA, Matias S, Murthy VN, Gershman SJ, Uchida N. Prospective contingency explains behavior and dopamine signals during associative learning. Nat Neurosci 2025:10.1038/s41593-025-01915-4. [PMID: 40102680 DOI: 10.1038/s41593-025-01915-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 02/06/2025] [Indexed: 03/20/2025]
Abstract
Associative learning depends on contingency, the degree to which a stimulus predicts an outcome. Despite its importance, the neural mechanisms linking contingency to behavior remain elusive. In the present study, we examined the dopamine activity in the ventral striatum-a signal implicated in associative learning-in a Pavlovian contingency degradation task in mice. We show that both anticipatory licking and dopamine responses to a conditioned stimulus decreased when additional rewards were delivered uncued, but remained unchanged if additional rewards were cued. These results conflict with contingency-based accounts using a traditional definition of contingency or a new causal learning model (ANCCR), but can be explained by temporal difference (TD) learning models equipped with an appropriate intertrial interval state representation. Recurrent neural networks trained within a TD framework develop state representations akin to our best 'handcrafted' model. Our findings suggest that the TD error can be a measure that describes both contingency and dopaminergic activity.
Collapse
Affiliation(s)
- Lechen Qian
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Mark Burrell
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Jay A Hennig
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Psychology, Harvard University, Cambridge, MA, USA
| | - Sara Matias
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Venkatesh N Murthy
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Samuel J Gershman
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Psychology, Harvard University, Cambridge, MA, USA
| | - Naoshige Uchida
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA.
- Center for Brain Science, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
5
|
Gershman SJ, Lak A. Policy Complexity Suppresses Dopamine Responses. J Neurosci 2025; 45:e1756242024. [PMID: 39788740 PMCID: PMC11866995 DOI: 10.1523/jneurosci.1756-24.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 12/17/2024] [Accepted: 12/23/2024] [Indexed: 01/12/2025] Open
Abstract
Limits on information processing capacity impose limits on task performance. We show that male and female mice achieve performance on a perceptual decision task that is near-optimal given their capacity limits, as measured by policy complexity (the mutual information between states and actions). This behavioral profile could be achieved by reinforcement learning with a penalty on high complexity policies, realized through modulation of dopaminergic learning signals. In support of this hypothesis, we find that policy complexity suppresses midbrain dopamine responses to reward outcomes. Furthermore, neural and behavioral reward sensitivity were positively correlated across sessions. Our results suggest that policy compression shapes basic mechanisms of reinforcement learning in the brain.
Collapse
Affiliation(s)
- Samuel J Gershman
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138
| | - Armin Lak
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
6
|
Ding M, Tomsick PL, Young RA, Jadhav SP. Ventral tegmental area dopamine neural activity switches simultaneously with rule representations in the prefrontal cortex and hippocampus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.09.09.611811. [PMID: 39314328 PMCID: PMC11419070 DOI: 10.1101/2024.09.09.611811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Multiple brain regions need to coordinate activity to support cognitive flexibility and behavioral adaptation. Neural activity in both the hippocampus (HPC) and prefrontal cortex (PFC) is known to represent spatial context and is sensitive to reward and rule alterations. Midbrain dopamine (DA) activity is key in reward seeking behavior and learning. There is abundant evidence that midbrain DA modulates HPC and PFC activity. However, it remains underexplored how these networks engage dynamically and coordinate temporally when animals must adjust their behavior according to changing reward contingencies. In particular, is there any relationship between DA reward prediction change during rule switching, and rule representation changes in PFC and CA1? We addressed these questions using simultaneous recording of neuronal population activity from the hippocampal area CA1, PFC and ventral tegmental area (VTA) in male TH-Cre rats performing two spatial working memory tasks with frequent rule switches in blocks of trials. CA1 and PFC ensembles showed rule-specific activity both during maze running and at reward locations, with PFC rule coding more consistent across animals compared to CA1. Optogenetically tagged VTA DA neuron firing activity responded to and predicted reward outcome. We found that the correct prediction in DA emerged gradually over trials after rule-switching in coordination with transitions in PFC and CA1 ensemble representations of the current rule after a rule switch, followed by behavioral adaptation to the correct rule sequence. Therefore, our study demonstrates a crucial temporal coordination between the rule representation in PFC/CA1, the dopamine reward signal and behavioral strategy.
Collapse
Affiliation(s)
- Mingxin Ding
- Graduate Program in Neuroscience, Brandeis University, Waltham, MA 02453, USA
| | - Porter L. Tomsick
- Undergraduate Program in Neuroscience, Brandeis University, Waltham, MA 02453, USA
- Department of Neuroscience, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | - Ryan A. Young
- Department of Psychology, Brandeis University, Waltham, MA, 02453, USA
| | - Shantanu P. Jadhav
- Graduate Program in Neuroscience, Brandeis University, Waltham, MA 02453, USA
- Department of Psychology, Brandeis University, Waltham, MA, 02453, USA
- Volen National Center for Complex Systems, Brandeis University, Waltham, MA, 02453, USA
| |
Collapse
|
7
|
Mishchanchuk K, Gregoriou G, Qü A, Kastler A, Huys QJM, Wilbrecht L, MacAskill AF. Hidden state inference requires abstract contextual representations in the ventral hippocampus. Science 2024; 386:926-932. [PMID: 39571013 DOI: 10.1126/science.adq5874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 10/16/2024] [Indexed: 11/24/2024]
Abstract
The ability to use subjective, latent contextual representations to influence decision-making is crucial for everyday life. The hippocampus is hypothesized to bind together otherwise abstract combinations of stimuli to represent such latent contexts, to support the process of hidden state inference. Yet evidence for a role of the hippocampus in hidden state inference remains limited. We found that the ventral hippocampus is required for mice to perform hidden state inference during a two-armed bandit task. Hippocampal neurons differentiate the two abstract contexts required for this strategy in a manner similar to the differentiation of spatial locations, and their activity is essential for appropriate dopamine dynamics. These findings offer insight into how latent contextual information is used to optimize decisions, and they emphasize a key role for the hippocampus in hidden state inference.
Collapse
Affiliation(s)
- Karyna Mishchanchuk
- Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
| | - Gabrielle Gregoriou
- Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
| | - Albert Qü
- Helen Wills Institute of Neuroscience, Department of Psychology, University of California, Berkeley, CA, USA
- Center for Computational Biology, University of California, Berkeley, CA, USA
| | - Alizée Kastler
- Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
| | - Quentin J M Huys
- Applied Computational Psychiatry Lab, Mental Health Neuroscience Department, Division of Psychiatry and Max Planck UCL Centre for Computational Psychiatry and Ageing Research, Queen Square Institute of Neurology, University College London, London, UK
| | - Linda Wilbrecht
- Helen Wills Institute of Neuroscience, Department of Psychology, University of California, Berkeley, CA, USA
| | - Andrew F MacAskill
- Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
| |
Collapse
|
8
|
Lehmann K, Bolis D, Friston KJ, Schilbach L, Ramstead MJD, Kanske P. An Active-Inference Approach to Second-Person Neuroscience. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2024; 19:931-951. [PMID: 37565656 PMCID: PMC11539477 DOI: 10.1177/17456916231188000] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023]
Abstract
Social neuroscience has often been criticized for approaching the investigation of the neural processes that enable social interaction and cognition from a passive, detached, third-person perspective, without involving any real-time social interaction. With the emergence of second-person neuroscience, investigators have uncovered the unique complexity of neural-activation patterns in actual, real-time interaction. Social cognition that occurs during social interaction is fundamentally different from that unfolding during social observation. However, it remains unclear how the neural correlates of social interaction are to be interpreted. Here, we leverage the active-inference framework to shed light on the mechanisms at play during social interaction in second-person neuroscience studies. Specifically, we show how counterfactually rich mutual predictions, real-time bodily adaptation, and policy selection explain activation in components of the default mode, salience, and frontoparietal networks of the brain, as well as in the basal ganglia. We further argue that these processes constitute the crucial neural processes that underwrite bona fide social interaction. By placing the experimental approach of second-person neuroscience on the theoretical foundation of the active-inference framework, we inform the field of social neuroscience about the mechanisms of real-life interactions. We thereby contribute to the theoretical foundations of empirical second-person neuroscience.
Collapse
Affiliation(s)
- Konrad Lehmann
- Clinical Psychology and Behavioral Neuroscience, Faculty of Psychology, Technische Universität Dresden, Germany
| | - Dimitris Bolis
- Laboratory for Autism and Neurodevelopmental Disorders, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, Rovereto, Italy
- Independent Max Planck Research Group for Social Neuroscience, Max Planck Institute of Psychiatry, Munich, Germany
- National Institute for Physiological Sciences, Okazaki, Japan
- Centre for Philosophy of Science, University of Lisbon, Portugal
| | - Karl J. Friston
- Wellcome Centre for Human Neuroimaging, University College London, UK
- VERSES AI Research Lab, Los Angeles, CA, USA
| | - Leonhard Schilbach
- Independent Max Planck Research Group for Social Neuroscience, Max Planck Institute of Psychiatry, Munich, Germany
- Department of Psychiatry and Psychotherapy, University Hospital, Ludwig Maximilians Universität, Munich, Germany
- Department of General Psychiatry 2, Clinics of the Heinrich Heine University Düsseldorf, Germany
| | - Maxwell J. D. Ramstead
- Wellcome Centre for Human Neuroimaging, University College London, UK
- VERSES AI Research Lab, Los Angeles, CA, USA
| | - Philipp Kanske
- Clinical Psychology and Behavioral Neuroscience, Faculty of Psychology, Technische Universität Dresden, Germany
| |
Collapse
|
9
|
Zhang Z, Takahashi YK, Montesinos-Cartegena M, Kahnt T, Langdon AJ, Schoenbaum G. Expectancy-related changes in firing of dopamine neurons depend on hippocampus. Nat Commun 2024; 15:8911. [PMID: 39414794 PMCID: PMC11484966 DOI: 10.1038/s41467-024-53308-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 10/07/2024] [Indexed: 10/18/2024] Open
Abstract
The orbitofrontal cortex (OFC) and hippocampus (HC) both contribute to the cognitive maps that support flexible behavior. Previously, we used the dopamine neurons to measure the functional role of OFC. We recorded midbrain dopamine neurons as rats performed an odor-based choice task, in which expected rewards were manipulated across blocks. We found that ipsilateral OFC lesions degraded dopaminergic prediction errors, consistent with reduced resolution of the task states. Here we have repeated this experiment in male rats with ipsilateral HC lesions. The results show HC also shapes the task states, however unlike OFC, which provides information local to the trial, the HC is necessary for estimating upper-level hidden states that distinguish blocks. The results contrast the roles of the OFC and HC in cognitive mapping and suggest that the dopamine neurons access rich information from distributed regions regarding the environment's structure, potentially enabling this teaching signal to support complex behaviors.
Collapse
Affiliation(s)
- Zhewei Zhang
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| | - Yuji K Takahashi
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | | | - Thorsten Kahnt
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | - Angela J Langdon
- Intramural Research Program, National Institute on Mental Health, Bethesda, MD, USA
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| |
Collapse
|
10
|
Chen W, Liang J, Wu Q, Han Y. Anterior cingulate cortex provides the neural substrates for feedback-driven iteration of decision and value representation. Nat Commun 2024; 15:6020. [PMID: 39019943 PMCID: PMC11255269 DOI: 10.1038/s41467-024-50388-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 07/05/2024] [Indexed: 07/19/2024] Open
Abstract
Adjusting decision-making under uncertain and dynamic situations is the hallmark of intelligence. It requires a system capable of converting feedback information to renew the internal value. The anterior cingulate cortex (ACC) involves in error and reward events that prompt switching or maintenance of current decision strategies. However, it is unclear whether and how the changes of stimulus-action mapping during behavioral adaptation are encoded, nor how such computation drives decision adaptation. Here, we tracked ACC activity in male mice performing go/no-go auditory discrimination tasks with manipulated stimulus-reward contingencies. Individual ACC neurons integrate the outcome information to the value representation in the next-run trials. Dynamic recruitment of them determines the learning rate of error-guided value iteration and decision adaptation, forming a non-linear feedback-driven updating system to secure the appropriate decision switch. Optogenetically suppressing ACC significantly slowed down feedback-driven decision switching without interfering with the execution of the established strategy.
Collapse
Affiliation(s)
- Wenqi Chen
- Department of Neurobiology, School of Basic Medicine, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China
| | - Jiejunyi Liang
- State Key Laboratory of Intelligent Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Qiyun Wu
- State Key Laboratory of Intelligent Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Yunyun Han
- Department of Neurobiology, School of Basic Medicine, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China.
| |
Collapse
|
11
|
Zhou X, Wu W. Depth-based statistical analysis in the spike train space. J Appl Stat 2024; 52:329-355. [PMID: 39926174 PMCID: PMC11800346 DOI: 10.1080/02664763.2024.2369954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 06/10/2024] [Indexed: 02/11/2025]
Abstract
Metric-based summary statistics such as mean and covariance have been introduced in neural spike train space. They can properly describe template and variability in spike train data, but are often sensitive to outliers and expensive to compute. Recent studies also examine outlier detection and classification methods on point processes. These tools provide reasonable result, whereas the accuracy remains at a low level in certain cases. In this study, we propose to adopt a well-established notion of statistical depth to the spike train space. This framework can naturally define the median in a set of spike trains, which provides a robust description of the 'template' of the observations. It also provides a principled method to identify 'outliers' and classify data from different categories. We systematically compare the new median, outlier detection and classification tools with state-of-the-art competing methods. The result shows the median has superior description for template than the mean. Moreover, the proposed outlier detection and classification perform more accurately than previous methods.
Collapse
Affiliation(s)
- Xinyu Zhou
- Statistics Department, Florida State University, Tallahassee, FL, USA
| | - Wei Wu
- Statistics Department, Florida State University, Tallahassee, FL, USA
| |
Collapse
|
12
|
Katayama R, Shiraki R, Ishii S, Yoshida W. Belief inference for hierarchical hidden states in spatial navigation. Commun Biol 2024; 7:614. [PMID: 38773301 PMCID: PMC11109253 DOI: 10.1038/s42003-024-06316-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 05/10/2024] [Indexed: 05/23/2024] Open
Abstract
Uncertainty abounds in the real world, and in environments with multiple layers of unobservable hidden states, decision-making requires resolving uncertainties based on mutual inference. Focusing on a spatial navigation problem, we develop a Tiger maze task that involved simultaneously inferring the local hidden state and the global hidden state from probabilistically uncertain observation. We adopt a Bayesian computational approach by proposing a hierarchical inference model. Applying this to human task behaviour, alongside functional magnetic resonance brain imaging, allows us to separate the neural correlates associated with reinforcement and reassessment of belief in hidden states. The imaging results also suggest that different layers of uncertainty differentially involve the basal ganglia and dorsomedial prefrontal cortex, and that the regions responsible are organised along the rostral axis of these areas according to the type of inference and the level of abstraction of the hidden state, i.e. higher-order state inference involves more anterior parts.
Collapse
Affiliation(s)
- Risa Katayama
- Graduate School of Informatics, Kyoto University, Kyoto, 606-8501, Japan.
- Department of AI-Brain Integration, Advanced Telecommunications Research Institute International, Kyoto, 619-0288, Japan.
| | - Ryo Shiraki
- Graduate School of Informatics, Kyoto University, Kyoto, 606-8501, Japan
| | - Shin Ishii
- Graduate School of Informatics, Kyoto University, Kyoto, 606-8501, Japan
- Neural Information Analysis Laboratories, Advanced Telecommunications Research Institute International, Kyoto, 619-0288, Japan
- International Research Center for Neurointelligence, the University of Tokyo, Tokyo, 113-0033, Japan
| | - Wako Yoshida
- Department of Neural Computation for Decision-Making, Advanced Telecommunications Research Institute International, Kyoto, 619-0288, Japan
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| |
Collapse
|
13
|
Pereira-Obilinovic U, Hou H, Svoboda K, Wang XJ. Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values. Proc Natl Acad Sci U S A 2024; 121:e2318521121. [PMID: 38551832 PMCID: PMC10998608 DOI: 10.1073/pnas.2318521121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/16/2024] [Indexed: 04/02/2024] Open
Abstract
During foraging behavior, action values are persistently encoded in neural activity and updated depending on the history of choice outcomes. What is the neural mechanism for action value maintenance and updating? Here, we explore two contrasting network models: synaptic learning of action value versus neural integration. We show that both models can reproduce extant experimental data, but they yield distinct predictions about the underlying biological neural circuits. In particular, the neural integrator model but not the synaptic model requires that reward signals are mediated by neural pools selective for action alternatives and their projections are aligned with linear attractor axes in the valuation system. We demonstrate experimentally observable neural dynamical signatures and feasible perturbations to differentiate the two contrasting scenarios, suggesting that the synaptic model is a more robust candidate mechanism. Overall, this work provides a modeling framework to guide future experimental research on probabilistic foraging.
Collapse
Affiliation(s)
- Ulises Pereira-Obilinovic
- Center for Neural Science, New York University, New York, NY10003
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Han Hou
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Karel Svoboda
- Allen Institute for Neural Dynamics, Seattle, WA98109
| | - Xiao-Jing Wang
- Center for Neural Science, New York University, New York, NY10003
| |
Collapse
|
14
|
Gorrell S, Shott ME, Pryor T, Frank GKW. Neural Response to Expecting a Caloric Sweet Taste Stimulus Predicts Body Mass Index Longitudinally Among Young Adult Women With Anorexia Nervosa. BIOLOGICAL PSYCHIATRY. COGNITIVE NEUROSCIENCE AND NEUROIMAGING 2024; 9:298-304. [PMID: 37506848 PMCID: PMC10811282 DOI: 10.1016/j.bpsc.2023.07.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 06/28/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023]
Abstract
BACKGROUND Anorexia nervosa (AN) is an often-chronic illness, and we lack biomarkers to predict long-term outcome. Recent neuroimaging studies using caloric taste stimuli suggest that paradigms that have tested conditioned neural responses to expectation or salient stimulus receipt may underpin behaviors. However, whether activation of those neural circuits can predict long-term outcome has not been studied. METHODS We followed women treated for AN (n = 35, mean age [SD] = 23 [7] years) and tested whether functional imaging brain response during a taste conditioning paradigm could predict posttreatment body mass index (BMI). We anticipated greater neural activity relative to caloric stimulus expectation and that dopamine-related receipt conditions would predict lower posttreatment BMI, indicating fear-associated arousal. RESULTS Follow-up occurred at mean (SD) = 1648 (1216) days after imaging. Stimulus expectation in orbitofrontal and striatal regions and BMI and BMI change at follow-up were negatively correlated, and these correlations remained significant for the right superior orbitofrontal cortex and BMI change after multiple comparison correction (r = -0.484, p = .003). This relationship remained significant after including time between brain scanning and follow-up in the model. Reward prediction error response did not predict long-term BMI. CONCLUSIONS The orbitofrontal cortex is involved in learning and conditioning, and these data implicate this region in learned caloric stimulus expectation and long-term prediction of weight outcomes in AN. Thus, conditioned elevated brain response to the anticipation of receiving a caloric stimulus may drive food avoidance, suggesting that breaking such associations is central for long-term recovery from AN.
Collapse
Affiliation(s)
- Sasha Gorrell
- Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, California
| | - Megan E Shott
- Department of Psychiatry, University of California, San Diego, San Diego, California
| | | | - Guido K W Frank
- Department of Psychiatry, University of California, San Diego, San Diego, California.
| |
Collapse
|
15
|
Wilbrecht L, Davidow JY. Goal-directed learning in adolescence: neurocognitive development and contextual influences. Nat Rev Neurosci 2024; 25:176-194. [PMID: 38263216 DOI: 10.1038/s41583-023-00783-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2023] [Indexed: 01/25/2024]
Abstract
Adolescence is a time during which we transition to independence, explore new activities and begin pursuit of major life goals. Goal-directed learning, in which we learn to perform actions that enable us to obtain desired outcomes, is central to many of these processes. Currently, our understanding of goal-directed learning in adolescence is itself in a state of transition, with the scientific community grappling with inconsistent results. When we examine metrics of goal-directed learning through the second decade of life, we find that many studies agree there are steady gains in performance in the teenage years, but others report that adolescent goal-directed learning is already adult-like, and some find adolescents can outperform adults. To explain the current variability in results, sophisticated experimental designs are being applied to test learning in different contexts. There is also increasing recognition that individuals of different ages and in different states will draw on different neurocognitive systems to support goal-directed learning. Through adoption of more nuanced approaches, we can be better prepared to recognize and harness adolescent strengths and to decipher the purpose (or goals) of adolescence itself.
Collapse
Affiliation(s)
- Linda Wilbrecht
- Department of Psychology, University of California, Berkeley, CA, USA.
- Helen Wills Neuroscience Institute, University of California, Berkeley, CA, USA.
| | - Juliet Y Davidow
- Department of Psychology, Northeastern University, Boston, MA, USA.
| |
Collapse
|
16
|
Fetcho RN, Parekh PK, Chou J, Kenwood M, Chalençon L, Estrin DJ, Johnson M, Liston C. A stress-sensitive frontostriatal circuit supporting effortful reward-seeking behavior. Neuron 2024; 112:473-487.e4. [PMID: 37963470 PMCID: PMC11533377 DOI: 10.1016/j.neuron.2023.10.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 07/06/2023] [Accepted: 10/18/2023] [Indexed: 11/16/2023]
Abstract
Effort valuation-a process for selecting actions based on the anticipated value of rewarding outcomes and expectations about the work required to obtain them-plays a fundamental role in decision-making. Effort valuation is disrupted in chronic stress states and is supported by the anterior cingulate cortex (ACC), but the circuit-level mechanisms by which the ACC regulates effort-based decision-making are unclear. Here, we show that ACC neurons projecting to the nucleus accumbens (ACC-NAc) play a critical role in effort valuation behavior in mice. Activity in ACC-NAc cells integrates both reward- and effort-related information, encoding a reward-related signal that scales with effort requirements and is necessary for supporting future effortful decisions. Chronic corticosterone exposure reduces motivation, suppresses effortful reward-seeking, and disrupts ACC-NAc signals. Together, our results delineate a stress-sensitive ACC-NAc circuit that supports effortful reward-seeking behavior by integrating reward and effort signals and reinforcing effort allocation in the service of maximizing reward.
Collapse
Affiliation(s)
- Robert N Fetcho
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10021, USA; Weill Cornell/Rockefeller/Sloan Kettering Tri-Institutional MD-PhD Program, Weill Cornell Medicine, New York, NY 10021, USA; Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Puja K Parekh
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Jolin Chou
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Margaux Kenwood
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Laura Chalençon
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - David J Estrin
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10021, USA; Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Megan Johnson
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA
| | - Conor Liston
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10021, USA; Department of Psychiatry, Weill Cornell Medicine, New York, NY 10021, USA.
| |
Collapse
|
17
|
Qian L, Burrell M, Hennig JA, Matias S, Murthy VN, Gershman SJ, Uchida N. The role of prospective contingency in the control of behavior and dopamine signals during associative learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.05.578961. [PMID: 38370735 PMCID: PMC10871210 DOI: 10.1101/2024.02.05.578961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Associative learning depends on contingency, the degree to which a stimulus predicts an outcome. Despite its importance, the neural mechanisms linking contingency to behavior remain elusive. Here we examined the dopamine activity in the ventral striatum - a signal implicated in associative learning - in a Pavlovian contingency degradation task in mice. We show that both anticipatory licking and dopamine responses to a conditioned stimulus decreased when additional rewards were delivered uncued, but remained unchanged if additional rewards were cued. These results conflict with contingency-based accounts using a traditional definition of contingency or a novel causal learning model (ANCCR), but can be explained by temporal difference (TD) learning models equipped with an appropriate inter-trial-interval (ITI) state representation. Recurrent neural networks trained within a TD framework develop state representations like our best 'handcrafted' model. Our findings suggest that the TD error can be a measure that describes both contingency and dopaminergic activity.
Collapse
Affiliation(s)
- Lechen Qian
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- These authors contributed equally
| | - Mark Burrell
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- These authors contributed equally
| | - Jay A. Hennig
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Psychology, Harvard University, Cambridge, MA, USA
| | - Sara Matias
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Venkatesh. N. Murthy
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Samuel J. Gershman
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Psychology, Harvard University, Cambridge, MA, USA
| | - Naoshige Uchida
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| |
Collapse
|
18
|
Han S, Helmchen F. Behavior-relevant top-down cross-modal predictions in mouse neocortex. Nat Neurosci 2024; 27:298-308. [PMID: 38177341 DOI: 10.1038/s41593-023-01534-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 11/27/2023] [Indexed: 01/06/2024]
Abstract
Animals adapt to a constantly changing world by predicting their environment and the consequences of their actions. The predictive coding hypothesis proposes that the brain generates predictions and continuously compares them with sensory inputs to guide behavior. However, how the brain reconciles conflicting top-down predictions and bottom-up sensory information remains unclear. To address this question, we simultaneously imaged neuronal populations in the mouse somatosensory barrel cortex and posterior parietal cortex during an auditory-cued texture discrimination task. In mice that had learned the task with fixed tone-texture matching, the presentation of mismatched pairing induced conflicts between tone-based texture predictions and actual texture inputs. When decisions were based on the predicted rather than the actual texture, top-down information flow was dominant and texture representations in both areas were modified, whereas dominant bottom-up information flow led to correct representations and behavioral choice. Our findings provide evidence for hierarchical predictive coding in the mouse neocortex.
Collapse
Affiliation(s)
- Shuting Han
- Brain Research Institute, University of Zurich, Zurich, Switzerland.
| | - Fritjof Helmchen
- Brain Research Institute, University of Zurich, Zurich, Switzerland.
- Neuroscience Center Zurich (ZNZ), University of Zurich, Zurich, Switzerland.
- University Research Priority Program (URPP), Adaptive Brain Circuits in Development and Learning, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
19
|
Blanco-Pozo M, Akam T, Walton ME. Dopamine-independent effect of rewards on choices through hidden-state inference. Nat Neurosci 2024; 27:286-297. [PMID: 38216649 PMCID: PMC10849965 DOI: 10.1038/s41593-023-01542-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 12/01/2023] [Indexed: 01/14/2024]
Abstract
Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals that update value estimates. There is also accumulating evidence that animals in structured environments can use inference processes to facilitate behavioral flexibility. However, it is unclear how these two accounts of reward-guided decision-making should be integrated. Using a two-step task for mice, we show that dopamine reports RPEs using value information inferred from task structure knowledge, alongside information about reward rate and movement. Nonetheless, although rewards strongly influenced choices and dopamine activity, neither activating nor inhibiting dopamine neurons at trial outcome affected future choice. These data were recapitulated by a neural network model where cortex learned to track hidden task states by predicting observations, while basal ganglia learned values and actions via RPEs. This shows that the influence of rewards on choices can stem from dopamine-independent information they convey about the world's state, not the dopaminergic RPEs they produce.
Collapse
Affiliation(s)
- Marta Blanco-Pozo
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| | - Thomas Akam
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| | - Mark E Walton
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| |
Collapse
|
20
|
Wang H, Ortega HK, Kelly EB, Indajang J, Feng J, Li Y, Kwan AC. Frontal noradrenergic and cholinergic transients exhibit distinct spatiotemporal dynamics during competitive decision-making. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.23.576893. [PMID: 38328186 PMCID: PMC10849696 DOI: 10.1101/2024.01.23.576893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Norepinephrine (NE) and acetylcholine (ACh) are neuromodulators that are crucial for learning and decision-making. In the cortex, NE and ACh are released at specific sites along neuromodulatory axons, which would constrain their spatiotemporal dynamics at the subcellular scale. However, how the fluctuating patterns of NE and ACh signaling may be linked to behavioral events is unknown. Here, leveraging genetically encoded NE and ACh indicators, we use two-photon microscopy to visualize neuromodulatory signals in the superficial layer of the mouse medial frontal cortex during decision-making. Head-fixed mice engage in a competitive game called matching pennies against a computer opponent. We show that both NE and ACh transients carry information about decision-related variables including choice, outcome, and reinforcer. However, the two neuromodulators differ in their spatiotemporal pattern of task-related activation. Spatially, NE signals are more segregated with choice and outcome encoded at distinct locations, whereas ACh signals can multiplex and reflect different behavioral correlates at the same site. Temporally, task-driven NE transients were more synchronized and peaked earlier than ACh transients. To test functional relevance, using optogenetics we found that evoked elevation of NE, but not ACh, in the medial frontal cortex increases the propensity of the animals to switch and explore alternate options. Taken together, the results reveal distinct spatiotemporal patterns of rapid ACh and NE transients at the subcellular scale during decision-making in mice, which may endow these neuromodulators with different ways to impact neural plasticity to mediate learning and adaptive behavior.
Collapse
Affiliation(s)
- Hongli Wang
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, Connecticut, 06511, USA
| | - Heather K. Ortega
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, Connecticut, 06511, USA
| | - Emma B. Kelly
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, 06511, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, New York, 14853, USA
| | - Jonathan Indajang
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, New York, 14853, USA
| | - Jiesi Feng
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing, China
| | - Yulong Li
- State Key Laboratory of Membrane Biology, Peking University School of Life Sciences, Beijing, China
- PKU-IDG/McGovern Institute for Brain Research, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Chinese Institute for Brain Research, Beijing, China
| | - Alex C. Kwan
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, 06511, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, New York, 14853, USA
- Department of Psychiatry, Weill Cornell Medicine, New York, New York, 10065, USA
| |
Collapse
|
21
|
Shikano Y, Yagishita S, Tanaka KF, Takata N. Slow-rising and fast-falling dopaminergic dynamics jointly adjust negative prediction error in the ventral striatum. Eur J Neurosci 2023; 58:4502-4522. [PMID: 36843200 DOI: 10.1111/ejn.15945] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 02/22/2023] [Indexed: 02/28/2023]
Abstract
The greater the reward expectations are, the more different the brain's physiological response will be. Although it is well-documented that better-than-expected outcomes are encoded quantitatively via midbrain dopaminergic (DA) activity, it has been less addressed experimentally whether worse-than-expected outcomes are expressed quantitatively as well. We show that larger reward expectations upon unexpected reward omissions are associated with the preceding slower rise and following larger decrease (DA dip) in the DA concentration at the ventral striatum of mice. We set up a lever press task on a fixed ratio (FR) schedule requiring five lever presses as an effort for a food reward (FR5). The mice occasionally checked the food magazine without a reward before completing the task. The percentage of this premature magazine entry (PME) increased as the number of lever presses approached five, showing rising expectations with increasing proximity to task completion, and hence greater reward expectations. Fibre photometry of extracellular DA dynamics in the ventral striatum using a fluorescent protein (genetically encoded GPCR activation-based DA sensor: GRABDA2m ) revealed that the slow increase and fast decrease in DA levels around PMEs were correlated with the PME percentage, demonstrating a monotonic relationship between the DA dip amplitude and degree of expectations. Computational modelling of the lever press task implementing temporal difference errors and state transitions replicated the observed correlation between the PME frequency and DA dip amplitude in the FR5 task. Taken together, these findings indicate that the DA dip amplitude represents the degree of reward expectations monotonically, which may guide behavioural adjustment.
Collapse
Affiliation(s)
- Yu Shikano
- Division of Brain Sciences, Institute for Advanced Medical Research, Keio University School of Medicine, Tokyo, Japan
- Center for Disease Biology and Integrative Medicine, Faculty of Medicine, The University of Tokyo, Tokyo, Japan
| | - Sho Yagishita
- Center for Disease Biology and Integrative Medicine, Faculty of Medicine, The University of Tokyo, Tokyo, Japan
| | - Kenji F Tanaka
- Division of Brain Sciences, Institute for Advanced Medical Research, Keio University School of Medicine, Tokyo, Japan
| | - Norio Takata
- Division of Brain Sciences, Institute for Advanced Medical Research, Keio University School of Medicine, Tokyo, Japan
| |
Collapse
|
22
|
Seiler JPH, Rumpel S. Modeling fashion as an emergent collective behavior of bored individuals. Sci Rep 2023; 13:20480. [PMID: 37993553 PMCID: PMC10665449 DOI: 10.1038/s41598-023-47749-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/17/2023] [Indexed: 11/24/2023] Open
Abstract
Boredom is an aversive mental state that is typically evoked by monotony and drives individuals to seek novel information. Despite this effect on individual behavior, the consequences of boredom for collective behavior remain elusive. Here, we introduce an agent-based model of collective fashion behavior in which simplified agents interact randomly and repeatedly choose alternatives from a circular space of color variants. Agents are endowed with a memory of past experiences and a boredom parameter, promoting avoidance of monotony. Simulating collective color trends with this model captures aspects of real trends observed in fashion magazines. We manipulate the two parameters and observe that the boredom parameter is essential for perpetuating fashion dynamics in our model. Furthermore, highly bored agents lead future population trends, when acting coherently or being highly popular. Taken together, our study illustrates that highly bored individuals can guide collective dynamics of a population to continuously explore different variants of behavior.
Collapse
Affiliation(s)
- Johannes P-H Seiler
- Institute of Physiology, Focus Program Translational Neurosciences, University Medical Center, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 19, 55131, Mainz, Germany.
| | - Simon Rumpel
- Institute of Physiology, Focus Program Translational Neurosciences, University Medical Center, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 19, 55131, Mainz, Germany.
| |
Collapse
|
23
|
Kalhan S, Garrido MI, Hester R, Redish AD. Reward prediction-errors weighted by cue salience produces addictive behaviours in simulations, with asymmetrical learning and steeper delay discounting. Neural Netw 2023; 168:631-650. [PMID: 37844522 DOI: 10.1016/j.neunet.2023.09.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 07/23/2023] [Accepted: 09/19/2023] [Indexed: 10/18/2023]
Abstract
Dysfunction in learning and motivational systems are thought to contribute to addictive behaviours. Previous models have suggested that dopaminergic roles in learning and motivation could produce addictive behaviours through pharmacological manipulations that provide excess dopaminergic signalling towards these learning and motivational systems. Redish (2004) suggested a role based on dopaminergic signals of value prediction error, while (Zhang et al., 2009) suggested a role based on dopaminergic signals of motivation. However, both models present significant limitations. They do not explain the reduced sensitivity to drug-related costs/negative consequences, the increased impulsivity generally found in people with a substance use disorder, craving behaviours, and non-pharmacological dependence, all of which are key hallmarks of addictive behaviours. Here, we propose a novel mathematical definition of salience, that combines aspects of dopamine's role in both learning and motivation within the reinforcement learning framework. Using a single parameter regime, we simulated addictive behaviours that the (Zhang et al., 2009; Redish, 2004) models also produce but we went further in simulating the downweighting of drug-related negative prediction-errors, steeper delay discounting of drug rewards, craving behaviours and aspects of behavioural/non-pharmacological addictions. The current salience model builds on our recently proposed conceptual theory that salience modulates internal representation updating and may contribute to addictive behaviours by producing misaligned internal representations (Kalhan et al., 2021). Critically, our current mathematical model of salience argues that the seemingly disparate learning and motivational aspects of dopaminergic functioning may interact through a salience mechanism that modulates internal representation updating.
Collapse
Affiliation(s)
- Shivam Kalhan
- University of Melbourne, School of Psychological Sciences, Melbourne, Victoria, Australia.
| | - Marta I Garrido
- University of Melbourne, School of Psychological Sciences, Melbourne, Victoria, Australia; Graeme Clark Institute for Biomedical Engineering, Melbourne, Victoria, Australia
| | - Robert Hester
- University of Melbourne, School of Psychological Sciences, Melbourne, Victoria, Australia
| | - A David Redish
- Department of Neuroscience, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
24
|
Konishi M, Igarashi KM, Miura K. Biologically plausible local synaptic learning rules robustly implement deep supervised learning. Front Neurosci 2023; 17:1160899. [PMID: 37886676 PMCID: PMC10598703 DOI: 10.3389/fnins.2023.1160899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 08/31/2023] [Indexed: 10/28/2023] Open
Abstract
In deep neural networks, representational learning in the middle layer is essential for achieving efficient learning. However, the currently prevailing backpropagation learning rules (BP) are not necessarily biologically plausible and cannot be implemented in the brain in their current form. Therefore, to elucidate the learning rules used by the brain, it is critical to establish biologically plausible learning rules for practical memory tasks. For example, learning rules that result in a learning performance worse than that of animals observed in experimental studies may not be computations used in real brains and should be ruled out. Using numerical simulations, we developed biologically plausible learning rules to solve a task that replicates a laboratory experiment where mice learned to predict the correct reward amount. Although the extreme learning machine (ELM) and weight perturbation (WP) learning rules performed worse than the mice, the feedback alignment (FA) rule achieved a performance equal to that of BP. To obtain a more biologically plausible model, we developed a variant of FA, FA_Ex-100%, which implements direct dopamine inputs that provide error signals locally in the layer of focus, as found in the mouse entorhinal cortex. The performance of FA_Ex-100% was comparable to that of conventional BP. Finally, we tested whether FA_Ex-100% was robust against rule perturbations and biologically inevitable noise. FA_Ex-100% worked even when subjected to perturbations, presumably because it could calibrate the correct prediction error (e.g., dopaminergic signals) in the next step as a teaching signal if the perturbation created a deviation. These results suggest that simplified and biologically plausible learning rules, such as FA_Ex-100%, can robustly facilitate deep supervised learning when the error signal, possibly conveyed by dopaminergic neurons, is accurate.
Collapse
Affiliation(s)
- Masataka Konishi
- Department of Biosciences, School of Biological and Environmental Sciences, Kwansei Gakuin University, Sanda, Hyogo, Japan
| | - Kei M. Igarashi
- Department of Anatomy and Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, United States
| | - Keiji Miura
- Department of Biosciences, School of Biological and Environmental Sciences, Kwansei Gakuin University, Sanda, Hyogo, Japan
| |
Collapse
|
25
|
Rios A, Nonomura S, Kato S, Yoshida J, Matsushita N, Nambu A, Takada M, Hira R, Kobayashi K, Sakai Y, Kimura M, Isomura Y. Reward expectation enhances action-related activity of nigral dopaminergic and two striatal output pathways. Commun Biol 2023; 6:914. [PMID: 37673949 PMCID: PMC10482957 DOI: 10.1038/s42003-023-05288-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 08/25/2023] [Indexed: 09/08/2023] Open
Abstract
Neurons comprising nigrostriatal system play important roles in action selection. However, it remains unclear how this system integrates recent outcome information with current action (movement) and outcome (reward or no reward) information to achieve appropriate subsequent action. We examined how neuronal activity of substantia nigra pars compacta (SNc) and dorsal striatum reflects the level of reward expectation from recent outcomes in rats performing a reward-based choice task. Movement-related activity of direct and indirect pathway striatal projection neurons (dSPNs and iSPNs, respectively) were enhanced by reward expectation, similarly to the SNc dopaminergic neurons, in both medial and lateral nigrostriatal projections. Given the classical basal ganglia model wherein dopamine stimulates dSPNs and suppresses iSPNs through distinct dopamine receptors, dopamine might not be the primary driver of iSPN activity increasing following higher reward expectation. In contrast, outcome-related activity was affected by reward expectation in line with the classical model and reinforcement learning theory, suggesting purposive effects of reward expectation.
Collapse
Affiliation(s)
- Alain Rios
- Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan.
| | - Satoshi Nonomura
- Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Aichi, 484-8506, Japan
| | - Shigeki Kato
- Department of Molecular Genetics, Institute of Biomedical Science, Fukushima Medical University, Fukushima, 960-1295, Japan
| | - Junichi Yoshida
- Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
| | - Natsuki Matsushita
- Division of Laboratory Animal Research, Aichi Medical University, Aichi, 480-1195, Japan
| | - Atsushi Nambu
- Division of System Neurophysiology, National Institute of Physiological Sciences and Department of Physiological Sciences, SOKENDAI, Aichi, 444-8585, Japan
| | - Masahiko Takada
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Aichi, 484-8506, Japan
| | - Riichiro Hira
- Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
| | - Kazuto Kobayashi
- Department of Molecular Genetics, Institute of Biomedical Science, Fukushima Medical University, Fukushima, 960-1295, Japan
| | - Yutaka Sakai
- Brain Science Institute, Tamagawa University, Tokyo, 194-8610, Japan
| | - Minoru Kimura
- Brain Science Institute, Tamagawa University, Tokyo, 194-8610, Japan
| | - Yoshikazu Isomura
- Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan.
- Brain Science Institute, Tamagawa University, Tokyo, 194-8610, Japan.
| |
Collapse
|
26
|
Hennig JA, Romero Pinto SA, Yamaguchi T, Linderman SW, Uchida N, Gershman SJ. Emergence of belief-like representations through reinforcement learning. PLoS Comput Biol 2023; 19:e1011067. [PMID: 37695776 PMCID: PMC10513382 DOI: 10.1371/journal.pcbi.1011067] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 09/21/2023] [Accepted: 08/27/2023] [Indexed: 09/13/2023] Open
Abstract
To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming "beliefs"-optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN's learned representation encodes belief information, but only when the RNN's capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity.
Collapse
Affiliation(s)
- Jay A. Hennig
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
- Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America
| | - Sandra A. Romero Pinto
- Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Program in Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts, USA
| | - Takahiro Yamaguchi
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Future Research Department, Toyota Research Institute of North America, Toyota Motor North America, Ann Arbor, Michigan, United States of America
| | - Scott W. Linderman
- Wu Tsai Neurosciences Institute, Stanford University, Stanford, California, United States of America
- Department of Statistics, Stanford University, Stanford, California, United States of America
| | - Naoshige Uchida
- Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Samuel J. Gershman
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
- Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America
| |
Collapse
|
27
|
Takahashi YK, Zhang Z, Montesinos-Cartegena M, Kahnt T, Langdon AJ, Schoenbaum G. Expectancy-related changes in firing of dopamine neurons depend on hippocampus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.19.549728. [PMID: 37781610 PMCID: PMC10541105 DOI: 10.1101/2023.07.19.549728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
The orbitofrontal cortex (OFC) and hippocampus (HC) are both implicated in forming the cognitive or task maps that support flexible behavior. Previously, we used the dopamine neurons as a sensor or tool to measure the functional effects of OFC lesions (Takahashi et al., 2011). We recorded midbrain dopamine neurons as rats performed an odor-based choice task, in which errors in the prediction of reward were induced by manipulating the number or timing of the expected rewards across blocks of trials. We found that OFC lesions ipsilateral to the recording electrodes caused prediction errors to be degraded consistent with a loss in the resolution of the task states, particularly under conditions where hidden information was critical to sharpening the predictions. Here we have repeated this experiment, along with computational modeling of the results, in rats with ipsilateral HC lesions. The results show HC also shapes the map of our task, however unlike OFC, which provides information local to the trial, the HC appears to be necessary for estimating the upper-level hidden states based on the information that is discontinuous or separated by longer timescales. The results contrast the respective roles of the OFC and HC in cognitive mapping and add to evidence that the dopamine neurons access a rich information set from distributed regions regarding the predictive structure of the environment, potentially enabling this powerful teaching signal to support complex learning and behavior.
Collapse
Affiliation(s)
- Yuji K Takahashi
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | - Zhewei Zhang
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | | | - Thorsten Kahnt
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| | - Angela J Langdon
- Intramural Research Program, National Institute on Mental Health, Bethesda, MD
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD
| |
Collapse
|
28
|
LeDuke DO, Borio M, Miranda R, Tye KM. Anxiety and depression: A top-down, bottom-up model of circuit function. Ann N Y Acad Sci 2023; 1525:70-87. [PMID: 37129246 PMCID: PMC10695657 DOI: 10.1111/nyas.14997] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
A functional interplay of bottom-up and top-down processing allows an individual to appropriately respond to the dynamic environment around them. These processing modalities can be represented as attractor states using a dynamical systems model of the brain. The transition probability to move from one attractor state to another is dependent on the stability, depth, neuromodulatory tone, and tonic changes in plasticity. However, how does the relationship between these states change in disease states, such as anxiety or depression? We describe bottom-up and top-down processing from Marr's computational-algorithmic-implementation perspective to understand depressive and anxious disease states. We illustrate examples of bottom-up processing as basolateral amygdala signaling and projections and top-down processing as medial prefrontal cortex internal signaling and projections. Understanding these internal processing dynamics can help us better model the multifaceted elements of anxiety and depression.
Collapse
Affiliation(s)
- Deryn O. LeDuke
- Salk Institute for Biological Studies, La Jolla, California, USA
- Biomedical Sciences Graduate Program, University of California San Diego, La Jolla, California, USA
| | - Matilde Borio
- Salk Institute for Biological Studies, La Jolla, California, USA
| | - Raymundo Miranda
- Salk Institute for Biological Studies, La Jolla, California, USA
- Neurosciences Graduate Program, University of California San Diego, La Jolla, California, USA
| | - Kay M. Tye
- Salk Institute for Biological Studies, La Jolla, California, USA
- Howard Hughes Medical Institute, La Jolla, California, USA
- Kavli Institute for the Brain and Mind, La Jolla, California, USA
| |
Collapse
|
29
|
Bousseyrol E, Didienne S, Takillah S, Prevost-Solié C, Come M, Ahmed Yahia T, Mondoloni S, Vicq E, Tricoire L, Mourot A, Naudé J, Faure P. Dopaminergic and prefrontal dynamics co-determine mouse decisions in a spatial gambling task. Cell Rep 2023; 42:112523. [PMID: 37200189 DOI: 10.1016/j.celrep.2023.112523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 01/28/2023] [Accepted: 05/02/2023] [Indexed: 05/20/2023] Open
Abstract
The neural mechanisms by which animals initiate goal-directed actions, choose between options, or explore opportunities remain unknown. Here, we develop a spatial gambling task in which mice, to obtain intracranial self-stimulation rewards, self-determine the initiation, direction, vigor, and pace of their actions based on their knowledge of the outcomes. Using electrophysiological recordings, pharmacology, and optogenetics, we identify a sequence of oscillations and firings in the ventral tegmental area (VTA), orbitofrontal cortex (OFC), and prefrontal cortex (PFC) that co-encodes and co-determines self-initiation and choices. This sequence appeared with learning as an uncued realignment of spontaneous dynamics. Interactions between the structures varied with the reward context, particularly the uncertainty associated with the different options. We suggest that self-generated choices arise from a distributed circuit based on an OFC-VTA core determining whether to wait for or initiate actions, while the PFC is specifically engaged by reward uncertainty in action selection and pace.
Collapse
Affiliation(s)
- Elise Bousseyrol
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Steve Didienne
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Samir Takillah
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Clement Prevost-Solié
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Maxime Come
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Tarek Ahmed Yahia
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France
| | - Sarah Mondoloni
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France
| | - Eléonore Vicq
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France
| | - Ludovic Tricoire
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France
| | - Alexandre Mourot
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France
| | - Jérémie Naudé
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; CNRS, Université de Montpellier, INSERM - Institut de Génomique Fonctionnelle, 34094 Montpellier, France.
| | - Philippe Faure
- Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005 Paris, France; Brain Plasticity Laboratory, CNRS, ESPCI Paris, PSL Research University, 75005 Paris, France.
| |
Collapse
|
30
|
Takahashi YK, Stalnaker TA, Mueller LE, Harootonian SK, Langdon AJ, Schoenbaum G. Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model. Nat Neurosci 2023; 26:830-839. [PMID: 37081296 PMCID: PMC10646487 DOI: 10.1038/s41593-023-01310-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/16/2023] [Indexed: 04/22/2023]
Abstract
Dopamine neuron activity is tied to the prediction error in temporal difference reinforcement learning models. These models make significant simplifying assumptions, particularly with regard to the structure of the predictions fed into the dopamine neurons, which consist of a single chain of timepoint states. Although this predictive structure can explain error signals observed in many studies, it cannot cope with settings where subjects might infer multiple independent events and outcomes. In the present study, we recorded dopamine neurons in the ventral tegmental area in such a setting to test the validity of the single-stream assumption. Rats were trained in an odor-based choice task, in which the timing and identity of one of several rewards delivered in each trial changed across trial blocks. This design revealed an error signaling pattern that requires the dopamine neurons to access and update multiple independent predictive streams reflecting the subject's belief about timing and potentially unique identities of expected rewards.
Collapse
Affiliation(s)
- Yuji K Takahashi
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| | - Thomas A Stalnaker
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | - Lauren E Mueller
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA
| | | | - Angela J Langdon
- Intramural Research Program, National Institute of Mental Health, Bethesda, MD, USA.
| | - Geoffrey Schoenbaum
- Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
| |
Collapse
|
31
|
Alexander WH, Deraeve J, Vassena E. Dissociation and integration of outcome and state uncertainty signals in cognitive control. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2023:10.3758/s13415-023-01091-7. [PMID: 37058212 PMCID: PMC10390360 DOI: 10.3758/s13415-023-01091-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Accepted: 03/13/2023] [Indexed: 04/15/2023]
Abstract
Signals related to uncertainty are frequently observed in regions of the cognitive control network, including anterior cingulate/medial prefrontal cortex (ACC/mPFC), dorsolateral prefrontal cortex (dlPFC), and anterior insular cortex. Uncertainty generally refers to conditions in which decision variables may assume multiple possible values and can arise at multiple points in the perception-action cycle, including sensory input, inferred states of the environment, and the consequences of actions. These sources of uncertainty are frequently correlated: noisy input can lead to unreliable estimates of the state of the environment, with consequential influences on action selection. Given this correlation amongst various sources of uncertainty, dissociating the neural structures underlying their estimation presents an ongoing issue: a region associated with uncertainty related to outcomes may estimate outcome uncertainty itself, or it may reflect a cascade effect of state uncertainty on outcome estimates. In this study, we derive signals of state and outcome uncertainty from mathematical models of risk and observe regions in the cognitive control network whose activity is best explained by signals related to state uncertainty (anterior insula), outcome uncertainty (dlPFC), as well as regions that appear to integrate the two (ACC/mPFC).
Collapse
Affiliation(s)
- William H Alexander
- Center for Complex Systems & Brain Sciences, Florida Atlantic University, Boca Raton, FL, USA.
- Department of Psychology, Florida Atlantic University, Boca Raton, FL, USA.
- The Brain Institute, Florida Atlantic University, Boca Raton, FL, USA.
- Department of Experimental Psychology, Ghent University, Ghent, Belgium.
| | - James Deraeve
- Department of Experimental Psychology, Ghent University, Ghent, Belgium
| | - Eliana Vassena
- Experimental Psychopathology and Treatment, Behavioural Science Institute, Radboud University, Nijmegen, Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, Netherlands
| |
Collapse
|
32
|
Hennig JA, Pinto SAR, Yamaguchi T, Linderman SW, Uchida N, Gershman SJ. Emergence of belief-like representations through reinforcement learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.04.535512. [PMID: 37066383 PMCID: PMC10104054 DOI: 10.1101/2023.04.04.535512] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming "beliefs"-optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN's learned representation encodes belief information, but only when the RNN's capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity. Author Summary Natural environments are full of uncertainty. For example, just because my fridge had food in it yesterday does not mean it will have food today. Despite such uncertainty, animals can estimate which states and actions are the most valuable. Previous work suggests that animals estimate value using a brain area called the basal ganglia, using a process resembling a reinforcement learning algorithm called TD learning. However, traditional reinforcement learning algorithms cannot accurately estimate value in environments with state uncertainty (e.g., when my fridge's contents are unknown). One way around this problem is if agents form "beliefs," a probabilistic estimate of how likely each state is, given any observations so far. However, estimating beliefs is a demanding process that may not be possible for animals in more complex environments. Here we show that an artificial recurrent neural network (RNN) trained with TD learning can estimate value from observations, without explicitly estimating beliefs. The trained RNN's error signals resembled the neural activity of dopamine neurons measured during the same task. Importantly, the RNN's activity resembled beliefs, but only when the RNN had enough capacity. This work illustrates how animals could estimate value in uncertain environments without needing to first form beliefs, which may be useful in environments where computing the true beliefs is too costly.
Collapse
Affiliation(s)
- Jay A. Hennig
- Department of Psychology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| | - Sandra A. Romero Pinto
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Takahiro Yamaguchi
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Future Vehicle Research Department, Toyota Research Institute North America, Toyota Motor North America Inc., Ann Arbor, MI, USA
| | - Scott W. Linderman
- Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA
- Department of Statistics, Stanford University, Stanford, CA, USA
| | - Naoshige Uchida
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Samuel J. Gershman
- Department of Psychology, Harvard University, Cambridge, MA, USA
- Center for Brain Science, Harvard University, Cambridge, MA, USA
| |
Collapse
|
33
|
Wang Z, Lou S, Ma X, Guo H, Liu Y, Chen W, Lin D, Yang Y. Neural ensembles in the murine medial prefrontal cortex process distinct information during visual perceptual learning. BMC Biol 2023; 21:44. [PMID: 36829186 PMCID: PMC9960446 DOI: 10.1186/s12915-023-01529-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Accepted: 01/27/2023] [Indexed: 02/26/2023] Open
Abstract
BACKGROUND Perceptual learning refers to an augmentation of an organism's ability to respond to external stimuli, which has been described in most sensory modalities. Visual perceptual learning (VPL) is a manifestation of plasticity in visual information processing that occurs in the adult brain, and can be used to ameliorate the ability of patients with visual defects mainly based on an improvement of detection or discrimination of features in visual tasks. While some brain regions such as the primary visual cortex have been described to participate in VPL, the way more general high-level cognitive brain areas are involved in this process remains unclear. Here, we showed that the medial prefrontal cortex (mPFC) was essential for both the training and maintenance processes of VPL in mouse models. RESULTS We built a new VPL model in a custom-designed training chamber to enable the utilization of miniScopes when mice freely executed the VPL task. We found that pyramidal neurons in the mPFC participate in both the training process and maintenance of VPL. By recording the calcium activity of mPFC pyramidal neurons while mice freely executed the task, distinct ON and OFF neural ensembles tuned to different behaviors were identified, which might encode different cognitive information. Decoding analysis showed that mouse behaviors could be well predicted using the activity of each ON ensemble. Furthermore, VPL recruited more reward-related components in the mPFC. CONCLUSION We revealed the neural mechanism underlying vision improvement following VPL and identify distinct ON and OFF neural ensembles in the mPFC that tuned to different information during visual perceptual training. These results uncover an important role of the mPFC in VPL, with more reward-related components being also involved, and pave the way for future clarification of the reward signal coding rules in VPL.
Collapse
Affiliation(s)
- Zhenni Wang
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Shihao Lou
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Xiao Ma
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Hui Guo
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Yan Liu
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Wenjing Chen
- grid.59053.3a0000000121679639Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026 China
| | - Dating Lin
- grid.420090.f0000 0004 0533 7147Intramural Research Program, National Institute On Drug Abuse, National Institutes of Health, Baltimore, MD 21224 USA
| | - Yupeng Yang
- Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei, 230026, China.
| |
Collapse
|
34
|
Zaidel DW. The art of film: Perspective on neural clues to repeated attraction to movie watching. Neuropsychologia 2023; 180:108485. [PMID: 36680933 DOI: 10.1016/j.neuropsychologia.2023.108485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 01/04/2023] [Accepted: 01/14/2023] [Indexed: 01/20/2023]
Abstract
This article about possible neural underpinning of repeated attraction to watching movies is dedicated to the memory of Prof. Eran Zaidel, who made outstanding contributions to neuroscience (and loved watching movies). The film is an art form crafted by multiple artists from diverse fields, contributing specialized skills, talents, and creativity to the final product. Attention-attraction to all artworks has deep biological roots. Movies have been attracting audiences repeatedly ever since they were introduced over 100 years ago. Although countless studies analyzed the nature of the art, the neural underpinning of repeated attraction to viewing movies has been understudied. Here, clues gleaned from non-film findings are proposed. The perspective suggests that functions of the mesolimbic "reward pathway" associated with pleasure and joy, the brain regions responding to facial beauty, to pictorial art aesthetics, and to music listening with increased dopamine levels are all recruited in the repeated attraction.
Collapse
Affiliation(s)
- D W Zaidel
- Dept. of Psychology, Behavioral Neuroscience, University of California, Los Angeles (UCLA), Los Angeles, CA, USA.
| |
Collapse
|
35
|
Pusch R, Clark W, Rose J, Güntürkün O. Visual categories and concepts in the avian brain. Anim Cogn 2023; 26:153-173. [PMID: 36352174 PMCID: PMC9877096 DOI: 10.1007/s10071-022-01711-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/19/2022] [Accepted: 10/25/2022] [Indexed: 11/11/2022]
Abstract
Birds are excellent model organisms to study perceptual categorization and concept formation. The renewed focus on avian neuroscience has sparked an explosion of new data in the field. At the same time, our understanding of sensory and particularly visual structures in the avian brain has shifted fundamentally. These recent discoveries have revealed how categorization is mediated in the avian brain and has generated a theoretical framework that goes beyond the realm of birds. We review the contribution of avian categorization research-at the methodical, behavioral, and neurobiological levels. To this end, we first introduce avian categorization from a behavioral perspective and the common elements model of categorization. Second, we describe the functional and structural organization of the avian visual system, followed by an overview of recent anatomical discoveries and the new perspective on the avian 'visual cortex'. Third, we focus on the neurocomputational basis of perceptual categorization in the bird's visual system. Fourth, an overview of the avian prefrontal cortex and the prefrontal contribution to perceptual categorization is provided. The fifth section outlines how asymmetries of the visual system contribute to categorization. Finally, we present a mechanistic view of the neural principles of avian visual categorization and its putative extension to concept learning.
Collapse
Affiliation(s)
- Roland Pusch
- Biopsychology, Faculty of Psychology, Ruhr University Bochum, 44780, Bochum, Germany
| | - William Clark
- Neural Basis of Learning, Faculty of Psychology, Ruhr University Bochum, 44780, Bochum, Germany
| | - Jonas Rose
- Neural Basis of Learning, Faculty of Psychology, Ruhr University Bochum, 44780, Bochum, Germany
| | - Onur Güntürkün
- Biopsychology, Faculty of Psychology, Ruhr University Bochum, 44780, Bochum, Germany.
| |
Collapse
|
36
|
Eckstein MK, Master SL, Xia L, Dahl RE, Wilbrecht L, Collins AGE. The interpretation of computational model parameters depends on the context. eLife 2022; 11:e75474. [PMID: 36331872 PMCID: PMC9635876 DOI: 10.7554/elife.75474] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 09/09/2022] [Indexed: 11/06/2022] Open
Abstract
Reinforcement Learning (RL) models have revolutionized the cognitive and brain sciences, promising to explain behavior from simple conditioning to complex problem solving, to shed light on developmental and individual differences, and to anchor cognitive processes in specific brain mechanisms. However, the RL literature increasingly reveals contradictory results, which might cast doubt on these claims. We hypothesized that many contradictions arise from two commonly-held assumptions about computational model parameters that are actually often invalid: That parameters generalize between contexts (e.g. tasks, models) and that they capture interpretable (i.e. unique, distinctive) neurocognitive processes. To test this, we asked 291 participants aged 8-30 years to complete three learning tasks in one experimental session, and fitted RL models to each. We found that some parameters (exploration / decision noise) showed significant generalization: they followed similar developmental trajectories, and were reciprocally predictive between tasks. Still, generalization was significantly below the methodological ceiling. Furthermore, other parameters (learning rates, forgetting) did not show evidence of generalization, and sometimes even opposite developmental trajectories. Interpretability was low for all parameters. We conclude that the systematic study of context factors (e.g. reward stochasticity; task volatility) will be necessary to enhance the generalizability and interpretability of computational cognitive models.
Collapse
Affiliation(s)
| | - Sarah L Master
- Department of Psychology, University of California, BerkeleyBerkeleyUnited States
- Department of Psychology, New York UniversityNew YorkUnited States
| | - Liyu Xia
- Department of Psychology, University of California, BerkeleyBerkeleyUnited States
- Department of Mathematics, University of California, BerkeleyBerkeleyUnited States
| | - Ronald E Dahl
- Institute of Human Development, University of California, BerkeleyBerkeleyUnited States
| | - Linda Wilbrecht
- Department of Psychology, University of California, BerkeleyBerkeleyUnited States
- Helen Wills Neuroscience Institute, University of California, BerkeleyBerkeleyUnited States
| | - Anne GE Collins
- Department of Psychology, University of California, BerkeleyBerkeleyUnited States
- Helen Wills Neuroscience Institute, University of California, BerkeleyBerkeleyUnited States
| |
Collapse
|
37
|
Mitani K, Kawabata M, Isomura Y, Sakai Y. Automated and parallelized spike collision tests to identify spike signal projections. iScience 2022; 25:105071. [PMID: 36157577 PMCID: PMC9490030 DOI: 10.1016/j.isci.2022.105071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 08/09/2022] [Accepted: 08/30/2022] [Indexed: 10/28/2022] Open
Abstract
The spike collision test is a highly reliable technique to identify the axonal projection of a neuron recorded electrophysiologically for investigating functional spike information among brain areas. It is potentially applicable to more neuronal projections by combining multi-channel recording with optogenetic stimulation. Yet, it remains inefficient and laborious because an experimenter must visually select spikes in every channel and manually repeat spike collision tests for each neuron serially. Here, we automated spike collision tests for all channels in parallel (Multi-Linc analysis) in a multi-channel real-time processing system. The rat cortical neurons identified with this technique displayed physiological spike features consistent with excitatory projection neurons. Their antidromic spikes were similar in shape but slightly larger in amplitude compared with spontaneous spikes. In addition, we demonstrated simultaneous identification of reciprocal or bifurcating projections among cortical areas. Thus, our Multi-Linc analysis will be a powerful approach to elucidate interareal spike communication.
Collapse
Affiliation(s)
- Keita Mitani
- Brain Science Institute, Tamagawa University, Machida, Tokyo, Japan.,Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| | - Masanori Kawabata
- Brain Science Institute, Tamagawa University, Machida, Tokyo, Japan.,Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| | - Yoshikazu Isomura
- Brain Science Institute, Tamagawa University, Machida, Tokyo, Japan.,Department of Physiology and Cell Biology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| | - Yutaka Sakai
- Brain Science Institute, Tamagawa University, Machida, Tokyo, Japan
| |
Collapse
|
38
|
Lin WC, Liu C, Kosillo P, Tai LH, Galarce E, Bateup HS, Lammel S, Wilbrecht L. Transient food insecurity during the juvenile-adolescent period affects adult weight, cognitive flexibility, and dopamine neurobiology. Curr Biol 2022; 32:3690-3703.e5. [PMID: 35863352 PMCID: PMC10519557 DOI: 10.1016/j.cub.2022.06.089] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 04/01/2022] [Accepted: 06/29/2022] [Indexed: 10/17/2022]
Abstract
A major challenge for neuroscience, public health, and evolutionary biology is to understand the effects of scarcity and uncertainty on the developing brain. Currently, a significant fraction of children and adolescents worldwide experience insecure access to food. The goal of our work was to test in mice whether the transient experience of insecure versus secure access to food during the juvenile-adolescent period produced lasting differences in learning, decision-making, and the dopamine system in adulthood. We manipulated feeding schedules in mice from postnatal day (P)21 to P40 as food insecure or ad libitum and found that when tested in adulthood (after P60), males with different developmental feeding history showed significant differences in multiple metrics of cognitive flexibility in learning and decision-making. Adult females with different developmental feeding history showed no differences in cognitive flexibility but did show significant differences in adult weight. We next applied reinforcement learning models to these behavioral data. The best fit models suggested that in males, developmental feeding history altered how mice updated their behavior after negative outcomes. This effect was sensitive to task context and reward contingencies. Consistent with these results, in males, we found that the two feeding history groups showed significant differences in the AMPAR/NMDAR ratio of excitatory synapses on nucleus-accumbens-projecting midbrain dopamine neurons and evoked dopamine release in dorsal striatal targets. Together, these data show in a rodent model that transient differences in feeding history in the juvenile-adolescent period can have significant impacts on adult weight, learning, decision-making, and dopamine neurobiology.
Collapse
Affiliation(s)
- Wan Chen Lin
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA
| | - Christine Liu
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA
| | - Polina Kosillo
- Department of Molecular and Cellular Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Lung-Hao Tai
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA
| | - Ezequiel Galarce
- Robert Wood Johnson Foundation Health and Society Scholar, University of California Berkeley, Berkeley, CA 94720, USA
| | - Helen S Bateup
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA; Department of Molecular and Cellular Biology, University of California Berkeley, Berkeley, CA 94720, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| | - Stephan Lammel
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA; Department of Molecular and Cellular Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Linda Wilbrecht
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA; Department of Psychology, University of California Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
39
|
Nour MM, Liu Y, Dolan RJ. Functional neuroimaging in psychiatry and the case for failing better. Neuron 2022; 110:2524-2544. [PMID: 35981525 DOI: 10.1016/j.neuron.2022.07.005] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 06/06/2022] [Accepted: 07/08/2022] [Indexed: 12/27/2022]
Abstract
Psychiatric disorders encompass complex aberrations of cognition and affect and are among the most debilitating and poorly understood of any medical condition. Current treatments rely primarily on interventions that target brain function (drugs) or learning processes (psychotherapy). A mechanistic understanding of how these interventions mediate their therapeutic effects remains elusive. From the early 1990s, non-invasive functional neuroimaging, coupled with parallel developments in the cognitive neurosciences, seemed to signal a new era of neurobiologically grounded diagnosis and treatment in psychiatry. Yet, despite three decades of intense neuroimaging research, we still lack a neurobiological account for any psychiatric condition. Likewise, functional neuroimaging plays no role in clinical decision making. Here, we offer a critical commentary on this impasse and suggest how the field might fare better and deliver impactful neurobiological insights.
Collapse
Affiliation(s)
- Matthew M Nour
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; Wellcome Trust Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; Department of Psychiatry, University of Oxford, Oxford OX3 7JX, UK.
| | - Yunzhe Liu
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China; Chinese Institute for Brain Research, Beijing 102206, China
| | - Raymond J Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; Wellcome Trust Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China.
| |
Collapse
|
40
|
Margolis AE, Liu R, Conceição VA, Ramphal B, Pagliaccio D, DeSerisy ML, Koe E, Selmanovic E, Raudales A, Emanet N, Quinn AE, Beebe B, Pearson BL, Herbstman JB, Rauh VA, Fifer WP, Fox NA, Champagne FA. Convergent neural correlates of prenatal exposure to air pollution and behavioral phenotypes of risk for internalizing and externalizing problems: Potential biological and cognitive pathways. Neurosci Biobehav Rev 2022; 137:104645. [PMID: 35367513 DOI: 10.1016/j.neubiorev.2022.104645] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 03/20/2022] [Accepted: 03/28/2022] [Indexed: 02/04/2023]
Abstract
Humans are ubiquitously exposed to neurotoxicants in air pollution, causing increased risk for psychiatric outcomes. Effects of prenatal exposure to air pollution on early emerging behavioral phenotypes that increase risk of psychopathology remain understudied. We review animal models that represent analogues of human behavioral phenotypes that are risk markers for internalizing and externalizing problems (behavioral inhibition, behavioral exuberance, irritability), and identify commonalities among the neural mechanisms underlying these behavioral phenotypes and the neural targets of three types of air pollutants (polycyclic aromatic hydrocarbons, traffic-related air pollutants, fine particulate matter < 2.5 µm). We conclude that prenatal exposure to air pollutants increases risk for behavioral inhibition and irritability through distinct mechanisms, including altered dopaminergic signaling and hippocampal morphology, neuroinflammation, and decreased brain-derived neurotrophic factor expression. Future studies should investigate these effects in human longitudinal studies incorporating complex exposure measurement methods, neuroimaging, and behavioral characterization of temperament phenotypes and neurocognitive processing to facilitate efforts aimed at improving long-lasting developmental benefits for children, particularly those living in areas with high levels of exposure.
Collapse
Affiliation(s)
- Amy E Margolis
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA.
| | - Ran Liu
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Vasco A Conceição
- Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Bruce Ramphal
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - David Pagliaccio
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Mariah L DeSerisy
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Emily Koe
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Ena Selmanovic
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Amarelis Raudales
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Nur Emanet
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Aurabelle E Quinn
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA
| | - Beatrice Beebe
- Division of Child and Adolescent Psychiatry, New York State Psychiatric Institute, New York, NY, USA; Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Brandon L Pearson
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Julie B Herbstman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA; Columbia Center for Children's Environmental Health, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Virginia A Rauh
- Columbia Center for Children's Environmental Health, Mailman School of Public Health, Columbia University, New York, NY, USA; Heilbrunn Department of Population & Family Health, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - William P Fifer
- Department of Psychiatry, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA; Department of Pediatrics, Columbia University Medical Center, New York, NY, USA; Division of Developmental Neuroscience, New York State Psychiatric Institute, New York, NY, USA
| | - Nathan A Fox
- Neuroscience and Cognitive Science Program, University of Maryland, College Park, MD, USA; Department of Human Development and Quantitative Methodology, University of Maryland, College Park, MD, USA
| | | |
Collapse
|
41
|
The role of state uncertainty in the dynamics of dopamine. Curr Biol 2022; 32:1077-1087.e9. [PMID: 35114098 PMCID: PMC8930519 DOI: 10.1016/j.cub.2022.01.025] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 11/22/2021] [Accepted: 01/10/2022] [Indexed: 11/22/2022]
Abstract
Reinforcement learning models of the basal ganglia map the phasic dopamine signal to reward prediction errors (RPEs). Conventional models assert that, when a stimulus predicts a reward with fixed delay, dopamine activity during the delay should converge to baseline through learning. However, recent studies have found that dopamine ramps up before reward in certain conditions even after learning, thus challenging the conventional models. In this work, we show that sensory feedback causes an unbiased learner to produce RPE ramps. Our model predicts that when feedback gradually decreases during a trial, dopamine activity should resemble a "bump," whose ramp-up phase should, furthermore, be greater than that of conditions where the feedback stays high. We trained mice on a virtual navigation task with varying brightness, and both predictions were empirically observed. In sum, our theoretical and experimental results reconcile the seemingly conflicting data on dopamine behaviors under the RPE hypothesis.
Collapse
|
42
|
Skirzewski M, Molotchnikoff S, Hernandez LF, Maya-Vetencourt JF. Multisensory Integration: Is Medial Prefrontal Cortex Signaling Relevant for the Treatment of Higher-Order Visual Dysfunctions? Front Mol Neurosci 2022; 14:806376. [PMID: 35110996 PMCID: PMC8801884 DOI: 10.3389/fnmol.2021.806376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 12/17/2021] [Indexed: 11/29/2022] Open
Abstract
In the mammalian brain, information processing in sensory modalities and global mechanisms of multisensory integration facilitate perception. Emerging experimental evidence suggests that the contribution of multisensory integration to sensory perception is far more complex than previously expected. Here we revise how associative areas such as the prefrontal cortex, which receive and integrate inputs from diverse sensory modalities, can affect information processing in unisensory systems via processes of down-stream signaling. We focus our attention on the influence of the medial prefrontal cortex on the processing of information in the visual system and whether this phenomenon can be clinically used to treat higher-order visual dysfunctions. We propose that non-invasive and multisensory stimulation strategies such as environmental enrichment and/or attention-related tasks could be of clinical relevance to fight cerebral visual impairment.
Collapse
Affiliation(s)
- Miguel Skirzewski
- Rodent Cognition Research and Innovation Core, University of Western Ontario, London, ON, Canada
| | - Stéphane Molotchnikoff
- Département de Sciences Biologiques, Université de Montréal, Montreal, QC, Canada
- Département de Génie Electrique et Génie Informatique, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Luis F. Hernandez
- Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, United States
| | - José Fernando Maya-Vetencourt
- Department of Biology, University of Pisa, Pisa, Italy
- Centre for Synaptic Neuroscience, Istituto Italiano di Tecnologia (IIT), Genova, Italy
- *Correspondence: José Fernando Maya-Vetencourt
| |
Collapse
|
43
|
Cools R, Arnsten AFT. Neuromodulation of prefrontal cortex cognitive function in primates: the powerful roles of monoamines and acetylcholine. Neuropsychopharmacology 2022; 47:309-328. [PMID: 34312496 PMCID: PMC8617291 DOI: 10.1038/s41386-021-01100-8] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 07/06/2021] [Accepted: 07/06/2021] [Indexed: 02/07/2023]
Abstract
The primate prefrontal cortex (PFC) subserves our highest order cognitive operations, and yet is tremendously dependent on a precise neurochemical environment for proper functioning. Depletion of noradrenaline and dopamine, or of acetylcholine from the dorsolateral PFC (dlPFC), is as devastating as removing the cortex itself, and serotonergic influences are also critical to proper functioning of the orbital and medial PFC. Most neuromodulators have a narrow inverted U dose response, which coordinates arousal state with cognitive state, and contributes to cognitive deficits with fatigue or uncontrollable stress. Studies in monkeys have revealed the molecular signaling mechanisms that govern the generation and modulation of mental representations by the dlPFC, allowing dynamic regulation of network strength, a process that requires tight regulation to prevent toxic actions, e.g., as occurs with advanced age. Brain imaging studies in humans have observed drug and genotype influences on a range of cognitive tasks and on PFC circuit functional connectivity, e.g., showing that catecholamines stabilize representations in a baseline-dependent manner. Research in monkeys has already led to new treatments for cognitive disorders in humans, encouraging future research in this important field.
Collapse
Affiliation(s)
- Roshan Cools
- Department of Psychiatry, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Amy F T Arnsten
- Department of Neuroscience, Yale University School of Medicine, New Haven, CT, USA.
| |
Collapse
|
44
|
Kenwood MM, Kalin NH, Barbas H. The prefrontal cortex, pathological anxiety, and anxiety disorders. Neuropsychopharmacology 2022; 47:260-275. [PMID: 34400783 PMCID: PMC8617307 DOI: 10.1038/s41386-021-01109-z] [Citation(s) in RCA: 139] [Impact Index Per Article: 46.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 07/06/2021] [Accepted: 07/08/2021] [Indexed: 02/07/2023]
Abstract
Anxiety is experienced in response to threats that are distal or uncertain, involving changes in one's subjective state, autonomic responses, and behavior. Defensive and physiologic responses to threats that involve the amygdala and brainstem are conserved across species. While anxiety responses typically serve an adaptive purpose, when excessive, unregulated, and generalized, they can become maladaptive, leading to distress and avoidance of potentially threatening situations. In primates, anxiety can be regulated by the prefrontal cortex (PFC), which has expanded in evolution. This prefrontal expansion is thought to underlie primates' increased capacity to engage high-level regulatory strategies aimed at coping with and modifying the experience of anxiety. The specialized primate lateral, medial, and orbital PFC sectors are connected with association and limbic cortices, the latter of which are connected with the amygdala and brainstem autonomic structures that underlie emotional and physiological arousal. PFC pathways that interface with distinct inhibitory systems within the cortex, the amygdala, or the thalamus can regulate responses by modulating neuronal output. Within the PFC, pathways connecting cortical regions are poised to reduce noise and enhance signals for cognitive operations that regulate anxiety processing and autonomic drive. Specialized PFC pathways to the inhibitory thalamic reticular nucleus suggest a mechanism to allow passage of relevant signals from thalamus to cortex, and in the amygdala to modulate the output to autonomic structures. Disruption of specific nodes within the PFC that interface with inhibitory systems can affect the negative bias, failure to regulate autonomic arousal, and avoidance that characterize anxiety disorders.
Collapse
Affiliation(s)
- Margaux M Kenwood
- Department of Psychiatry, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
- Neuroscience Training Program at University of Wisconsin-Madison, Madison, USA
| | - Ned H Kalin
- Department of Psychiatry, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
- Neuroscience Training Program at University of Wisconsin-Madison, Madison, USA
- Wisconsin National Primate Center, Madison, WI, USA
| | - Helen Barbas
- Neural Systems Laboratory, Department of Health Sciences, Boston University, Boston, MA, USA.
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA, USA.
| |
Collapse
|
45
|
Averbeck B, O'Doherty JP. Reinforcement-learning in fronto-striatal circuits. Neuropsychopharmacology 2022; 47:147-162. [PMID: 34354249 PMCID: PMC8616931 DOI: 10.1038/s41386-021-01108-0] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 07/06/2021] [Accepted: 07/09/2021] [Indexed: 01/03/2023]
Abstract
We review the current state of knowledge on the computational and neural mechanisms of reinforcement-learning with a particular focus on fronto-striatal circuits. We divide the literature in this area into five broad research themes: the target of the learning-whether it be learning about the value of stimuli or about the value of actions; the nature and complexity of the algorithm used to drive the learning and inference process; how learned values get converted into choices and associated actions; the nature of state representations, and of other cognitive machinery that support the implementation of various reinforcement-learning operations. An emerging fifth area focuses on how the brain allocates or arbitrates control over different reinforcement-learning sub-systems or "experts". We will outline what is known about the role of the prefrontal cortex and striatum in implementing each of these functions. We then conclude by arguing that it will be necessary to build bridges from algorithmic level descriptions of computational reinforcement-learning to implementational level models to better understand how reinforcement-learning emerges from multiple distributed neural networks in the brain.
Collapse
Affiliation(s)
| | - John P O'Doherty
- Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA.
| |
Collapse
|
46
|
VTA dopamine neuron activity encodes social interaction and promotes reinforcement learning through social prediction error. Nat Neurosci 2021; 25:86-97. [PMID: 34857949 PMCID: PMC7612196 DOI: 10.1038/s41593-021-00972-9] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 10/29/2021] [Indexed: 11/13/2022]
Abstract
Social interactions are motivated behaviors that in many species facilitate learning. However, how the brain encodes the reinforcing properties of social interactions remains elusive. Here, using in vivo recording in freely moving mice, we show that dopamine (DA) neurons of the ventral tegmental area (VTA) increase their activity during interactions with an unfamiliar conspecific and display heterogeneous responses. Using a social instrumental task (SIT), we then show that VTA DA neuron activity encodes social prediction error and drives social reinforcement learning. Thus, our findings suggest that VTA DA neurons are a neural substrate for a social learning signal that drives motivated behavior.
Collapse
|
47
|
Kalbe F, Schwabe L. Prediction Errors for Aversive Events Shape Long-Term Memory Formation through a Distinct Neural Mechanism. Cereb Cortex 2021; 32:3081-3097. [PMID: 34849622 DOI: 10.1093/cercor/bhab402] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 09/09/2021] [Accepted: 10/12/2021] [Indexed: 11/13/2022] Open
Abstract
Prediction errors (PEs) have been known for decades to guide associative learning, but their role in episodic memory formation has been discovered only recently. To identify the neural mechanisms underlying the impact of aversive PEs on long-term memory formation, we used functional magnetic resonance imaging, while participants saw a series of unique stimuli and estimated the probability that an aversive shock would follow. Our behavioral data showed that negative PEs (i.e., omission of an expected outcome) were associated with superior recognition of the predictive stimuli, whereas positive PEs (i.e., presentation of an unexpected outcome) impaired subsequent memory. While medial temporal lobe (MTL) activity during stimulus encoding was overall associated with enhanced memory, memory-enhancing effects of negative PEs were linked to even decreased MTL activation. Additional large-scale network analyses showed PE-related increases in crosstalk between the "salience network" and a frontoparietal network commonly implicated in memory formation for expectancy-congruent events. These effects could not be explained by mere changes in physiological arousal or the prediction itself. Our results suggest that the superior memory for events associated with negative aversive PEs is driven by a potentially distinct neural mechanism that might serve to set these memories apart from those with expected outcomes.
Collapse
Affiliation(s)
- Felix Kalbe
- Department of Cognitive Psychology, Institute of Psychology, Universität Hamburg, Hamburg 20146, Germany
| | - Lars Schwabe
- Department of Cognitive Psychology, Institute of Psychology, Universität Hamburg, Hamburg 20146, Germany
| |
Collapse
|
48
|
McDougle SD, Ballard IC, Baribault B, Bishop SJ, Collins AGE. Executive Function Assigns Value to Novel Goal-Congruent Outcomes. Cereb Cortex 2021; 32:231-247. [PMID: 34231854 PMCID: PMC8634563 DOI: 10.1093/cercor/bhab205] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 05/10/2021] [Accepted: 06/04/2021] [Indexed: 11/14/2022] Open
Abstract
People often learn from the outcomes of their actions, even when these outcomes do not involve material rewards or punishments. How does our brain provide this flexibility? We combined behavior, computational modeling, and functional neuroimaging to probe whether learning from abstract novel outcomes harnesses the same circuitry that supports learning from familiar secondary reinforcers. Behavior and neuroimaging revealed that novel images can act as a substitute for rewards during instrumental learning, producing reliable reward-like signals in dopaminergic circuits. Moreover, we found evidence that prefrontal correlates of executive control may play a role in shaping flexible responses in reward circuits. These results suggest that learning from novel outcomes is supported by an interplay between high-level representations in prefrontal cortex and low-level responses in subcortical reward circuits. This interaction may allow for human reinforcement learning over arbitrarily abstract reward functions.
Collapse
Affiliation(s)
| | - Ian C Ballard
- Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94720, USA
| | - Beth Baribault
- Department of Psychology, University of California, Berkeley, CA 94704, USA
| | - Sonia J Bishop
- Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94720, USA
- Department of Psychology, University of California, Berkeley, CA 94704, USA
| | - Anne G E Collins
- Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94720, USA
- Department of Psychology, University of California, Berkeley, CA 94704, USA
| |
Collapse
|
49
|
Kutlu MG, Zachry JE, Melugin PR, Cajigas SA, Chevee MF, Kelly SJ, Kutlu B, Tian L, Siciliano CA, Calipari ES. Dopamine release in the nucleus accumbens core signals perceived saliency. Curr Biol 2021; 31:4748-4761.e8. [PMID: 34529938 PMCID: PMC9084920 DOI: 10.1016/j.cub.2021.08.052] [Citation(s) in RCA: 130] [Impact Index Per Article: 32.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 07/15/2021] [Accepted: 08/18/2021] [Indexed: 12/01/2022]
Abstract
A large body of work has aimed to define the precise information encoded by dopaminergic projections innervating the nucleus accumbens (NAc). Prevailing models are based on reward prediction error (RPE) theory, in which dopamine updates associations between rewards and predictive cues by encoding perceived errors between predictions and outcomes. However, RPE cannot describe multiple phenomena to which dopamine is inextricably linked, such as behavior driven by aversive and neutral stimuli. We combined a series of behavioral tasks with direct, subsecond dopamine monitoring in the NAc of mice, machine learning, computational modeling, and optogenetic manipulations to describe behavior and related dopamine release patterns across multiple contingencies reinforced by differentially valenced outcomes. We show that dopamine release only conforms to RPE predictions in a subset of learning scenarios but fits valence-independent perceived saliency encoding across conditions. Here, we provide an extended, comprehensive framework for accumbal dopamine release in behavioral control.
Collapse
Affiliation(s)
- Munir Gunes Kutlu
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA
| | - Jennifer E Zachry
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA
| | - Patrick R Melugin
- Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN 37232, USA
| | - Stephanie A Cajigas
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN 37232, USA
| | - Maxime F Chevee
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA
| | - Shannon J Kelly
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA
| | - Banu Kutlu
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA; Libraries Strategic Technologies, Penn State University Libraries, University Park, PA 16802, USA
| | - Lin Tian
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Sacramento, CA 95817, USA
| | - Cody A Siciliano
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Center for Addiction Research, Vanderbilt University, Nashville, TN 37232, USA
| | - Erin S Calipari
- Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Brain Institute, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Center for Addiction Research, Vanderbilt University, Nashville, TN 37232, USA; Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN 37232, USA; Department of Psychiatry and Behavioral Sciences, Vanderbilt University, Nashville, TN 37232, USA.
| |
Collapse
|
50
|
Klee JL, Souza BC, Battaglia FP. Learning differentially shapes prefrontal and hippocampal activity during classical conditioning. eLife 2021; 10:e65456. [PMID: 34665131 PMCID: PMC8545395 DOI: 10.7554/elife.65456] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 10/10/2021] [Indexed: 11/25/2022] Open
Abstract
The ability to use sensory cues to inform goal-directed actions is a critical component of behavior. To study how sounds guide anticipatory licking during classical conditioning, we employed high-density electrophysiological recordings from the hippocampal CA1 area and the prefrontal cortex (PFC) in mice. CA1 and PFC neurons undergo distinct learning-dependent changes at the single-cell level and maintain representations of cue identity at the population level. In addition, reactivation of task-related neuronal assemblies during hippocampal awake Sharp-Wave Ripples (aSWRs) changed within individual sessions in CA1 and over the course of multiple sessions in PFC. Despite both areas being highly engaged and synchronized during the task, we found no evidence for coordinated single cell or assembly activity during conditioning trials or aSWR. Taken together, our findings support the notion that persistent firing and reactivation of task-related neural activity patterns in CA1 and PFC support learning during classical conditioning.
Collapse
Affiliation(s)
- Jan L Klee
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands
| | - Bryan C Souza
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands
| | - Francesco P Battaglia
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands
| |
Collapse
|