1
|
Pool ER, Pauli WM, Cross L, O'Doherty JP. Neural substrates of parallel devaluation-sensitive and devaluation-insensitive Pavlovian learning in humans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.26.525637. [PMID: 36747799 PMCID: PMC9901183 DOI: 10.1101/2023.01.26.525637] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Pavlovian learning depends on multiple and parallel associations leading to distinct classes of conditioned responses that vary in their flexibility following changes in the value of an associated outcome. Here, we aimed to differentiate brain areas involved in learning and encoding associations that are sensitive to changes in the value of an outcome from those that are not sensitive to such changes. To address this question, we combined a Pavlovian learning task with outcome devaluation, eye-tracking and functional magnetic resonance imaging. We used computational modeling to identify brain regions involved in learning stimulus-reward associations and stimulus-stimulus associations, by testing for brain areas correlating with reward-prediction errors and state-prediction errors, respectively. We found that, contrary to theoretical predictions about reward prediction errors being exclusively model-free, voxels correlating with reward prediction errors in the ventral striatum and subgenual anterior cingulate cortex were sensitive to devaluation. On the other hand, brain areas correlating with state prediction errors were found to be devaluation insensitive. In a supplementary analysis, we distinguished brain regions encoding predictions about outcome taste identity from those involved in encoding predictions about its expected spatial location. A subset of regions involved in taste identity predictions were devaluation sensitive while those involved in encoding predictions about spatial location were devaluation insensitive. These findings provide insights into the role of multiple associative mechanisms in the brain in mediating Pavlovian conditioned behavior - illustrating how distinct neural pathways can in parallel produce both devaluation sensitive and devaluation insensitive behaviors.
Collapse
|
2
|
Ioannidou C, Busquets-Garcia A, Ferreira G, Marsicano G. Neural Substrates of Incidental Associations and Mediated Learning: The Role of Cannabinoid Receptors. Front Behav Neurosci 2021; 15:722796. [PMID: 34421557 PMCID: PMC8378742 DOI: 10.3389/fnbeh.2021.722796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 07/14/2021] [Indexed: 11/13/2022] Open
Abstract
The ability to form associations between different stimuli in the environment to guide adaptive behavior is a central element of learning processes, from perceptual learning in humans to Pavlovian conditioning in animals. Like so, classical conditioning paradigms that test direct associations between low salience sensory stimuli and high salience motivational reinforcers are extremely informative. However, a large part of everyday learning cannot be solely explained by direct conditioning mechanisms - this includes to a great extent associations between individual sensory stimuli, carrying low or null immediate motivational value. This type of associative learning is often described as incidental learning and can be captured in animal models through sensory preconditioning procedures. Here we summarize the evolution of research on incidental and mediated learning, overview the brain systems involved and describe evidence for the role of cannabinoid receptors in such higher-order learning tasks. This evidence favors a number of contemporary hypotheses concerning the participation of the endocannabinoid system in psychosis and psychotic experiences and provides a conceptual framework for understanding how the use of cannabinoid drugs can lead to altered perceptive states.
Collapse
Affiliation(s)
- Christina Ioannidou
- INSERM, U1215 Neurocentre Magendie, Bordeaux, France
- University of Bordeaux, Bordeaux, France
| | - Arnau Busquets-Garcia
- Integrative Pharmacology and Systems Neuroscience Research Group, Neurosciences Research Program, IMIM (Hospital del Mar Medical Research Institute), Barcelona, Spain
| | - Guillaume Ferreira
- University of Bordeaux, Bordeaux, France
- INRAE, Nutrition and Integrative Neurobiology, Bordeaux, France
| | - Giovanni Marsicano
- INSERM, U1215 Neurocentre Magendie, Bordeaux, France
- University of Bordeaux, Bordeaux, France
| |
Collapse
|
3
|
Mizunami M. What Is Learned in Pavlovian Conditioning in Crickets? Revisiting the S-S and S-R Learning Theories. Front Behav Neurosci 2021; 15:661225. [PMID: 34177477 PMCID: PMC8225941 DOI: 10.3389/fnbeh.2021.661225] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Accepted: 05/10/2021] [Indexed: 11/17/2022] Open
Abstract
In Pavlovian conditioning in mammals, two theories have been proposed for associations underlying conditioned responses (CRs). One theory, called S-S theory, assumes an association between a conditioned stimulus (CS) and internal representation of an unconditioned stimulus (US), allowing the animal to adjust the CR depending on the current value of the US. The other theory, called S-R theory, assumes an association or connection between the CS center and the CR center, allowing the CS to elicit the CR. Whether these theories account for Pavlovian conditioning in invertebrates has remained unclear. In this article, results of our studies in the cricket Gryllus bimaculatus are reviewed. We showed that after a standard amount of Pavlovian training, crickets exhibited no response to odor CS when water US was devalued by providing it until satiation, whereas after extended training, they exhibited a CR after US devaluation. An increase of behavioral automaticity by extended training has not been reported in Pavlovian conditioning in any other animals, but it has been documented in instrumental conditioning in mammals. Our pharmacological analysis suggested that octopamine neurons mediate US (water) value signals and control execution of the CR after standard training. The control, however, diminishes with extension of training and hence the CR becomes insensitive to the US value. We also found that the nature of the habitual response after extended Pavlovian training in crickets is not the same as that after extended instrumental training in mammals concerning the context specificity. Adaptive significance and evolutionary implications for our findings are discussed.
Collapse
|
4
|
Panayi MC, Killcross S. The Role of the Rodent Lateral Orbitofrontal Cortex in Simple Pavlovian Cue-Outcome Learning Depends on Training Experience. Cereb Cortex Commun 2021; 2:tgab010. [PMID: 34296155 PMCID: PMC8152875 DOI: 10.1093/texcom/tgab010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 01/29/2021] [Accepted: 02/01/2021] [Indexed: 11/30/2022] Open
Abstract
The orbitofrontal cortex (OFC) is a critical structure in the flexible control of value-based behaviors. OFC dysfunction is typically only detected when task or environmental contingencies change, against a backdrop of apparently intact initial acquisition and behavior. While intact acquisition following OFC lesions in simple Pavlovian cue-outcome conditioning is often predicted by models of OFC function, this predicted null effect has not been thoroughly investigated. Here, we test the effects of lesions and temporary muscimol inactivation of the rodent lateral OFC on the acquisition of a simple single cue-outcome relationship. Surprisingly, pretraining lesions significantly enhanced acquisition after overtraining, whereas post-training lesions and inactivation significantly impaired acquisition. This impaired acquisition to the cue reflects a disruption of behavioral control and not learning since the cue could also act as an effective blocking stimulus in an associative blocking procedure. These findings suggest that even simple cue-outcome representations acquired in the absence of OFC function are impoverished. Therefore, while OFC function is often associated with flexible behavioral control in complex environments, it is also involved in very simple Pavlovian acquisition where complex cue-outcome relationships are irrelevant to task performance.
Collapse
Affiliation(s)
- Marios C Panayi
- School of Psychology, UNSW Sydney, Sydney, NSW 2052, Australia
- National Institute on Drug Abuse Intramural Research Program, Cellular Neurobiology Research Branch, Behavioral Neurophysiology Research Section, 251 Bayview Blvd., Baltimore, MD 21224, USA
| | - Simon Killcross
- School of Psychology, UNSW Sydney, Sydney, NSW 2052, Australia
| |
Collapse
|
5
|
Wukitsch TJ, Brase EC, Moser TJ, Kiefer SW, Cain ME. Differential rearing alters taste reactivity to ethanol, sucrose, and quinine. Psychopharmacology (Berl) 2020; 237:583-597. [PMID: 31832722 PMCID: PMC7747299 DOI: 10.1007/s00213-019-05394-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 11/08/2019] [Indexed: 12/23/2022]
Abstract
RATIONALE Early-life environment influences reinforcer and drug motivation in adulthood; however, the impact on specific components of motivation, including hedonic value ("liking"), remains unknown. OBJECTIVES The current study determined whether differential rearing alters liking and aversive responding to ethanol, sucrose, and quinine in an ethanol-naïve rat model. METHODS Male and female rats were reared for 30 days starting at postnatal day 21 in either an enriched (EC), isolated (IC), or standard condition (SC). Thereafter, all rats had indwelling intraoral fistulae implanted and their taste reactivity to water, ethanol (5, 10, 20, 30, 40% v/v), sucrose (0.1, 0.25, 0.5 M), and quinine (0.1, 0.5 mM) was recorded and analyzed. RESULTS EC rats had higher amounts of liking responses to ethanol, sucrose, and quinine and higher amounts of aversive responses to ethanol and quinine compared to IC rats. While EC and IC rats' responses were different from each other, they both tended to be similar to SCs, who fell in between the EC and IC groups. CONCLUSIONS These results suggest that environmental enrichment may enhance sensitivity to a variety of tastants, thereby enhancing liking, while isolation may dull sensitivity, thereby dulling liking. Altogether, the evidence suggests that isolated rats have a shift in the allostatic set-point which may, in part, drive increased responding for a variety of rewards including ethanol and sucrose. Enriched rats have enhanced liking of both sucrose and ethanol suggesting that enrichment may offer a unique phenotype with divergent preferences for incentive motivation.
Collapse
Affiliation(s)
- Thomas J. Wukitsch
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr N, Manhattan, KS 66506-5302, USA
| | - Emma C. Brase
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr N, Manhattan, KS 66506-5302, USA
| | - Theodore J. Moser
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr N, Manhattan, KS 66506-5302, USA
| | - Stephen W. Kiefer
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr N, Manhattan, KS 66506-5302, USA
| | - Mary E. Cain
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr N, Manhattan, KS 66506-5302, USA
| |
Collapse
|
6
|
Mizunami M, Hirohata S, Sato A, Arai R, Terao K, Sato M, Matsumoto Y. Development of behavioural automaticity by extended Pavlovian training in an insect. Proc Biol Sci 2020; 286:20182132. [PMID: 30963861 DOI: 10.1098/rspb.2018.2132] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The effect of repetitive training on learned actions has been a major subject in behavioural neuroscience. Many studies of instrumental conditioning in mammals, including humans, suggested that learned actions early in training are goal-driven and controlled by outcome expectancy, but they become more automatic and insensitive to reduction in the value of the outcome after extended training. It was unknown, however, whether the development of value-insensitive behaviour also occurs by extended training of Pavlovian conditioning in any animals. Here we show that crickets Gryllus bimaculatus that had received minimal training to associate an odour with water (unconditioned stimulus, US) did not exhibit conditioned response (CR) to the odour when they were given water until satiation before the test, but those that had received extended training exhibited CR even when they were satiated with water. Further pharmacological experiments suggested that octopamine neurons, the invertebrate counterparts of noradrenaline neurons, mediate US value signals and control execution of CR after minimal training, but the control diminishes with the progress of training and hence the CR becomes insensitive to US devaluation. The results suggest that repetitive sensory experiences can lead to a change from a goal-driven response to a more automatic one in crickets.
Collapse
Affiliation(s)
- Makoto Mizunami
- 1 Faculty of Science, Hokkaido University , Sapporo 060-0810 , Japan
| | - Sho Hirohata
- 2 Graduate School of Life Science, Hokkaido University , Sapporo 060-0810 , Japan
| | - Ai Sato
- 4 Graduate School of Life Sciences, Tohoku University , Sendai 980-8577 , Japan
| | - Ryoichi Arai
- 3 Department of Biological Sciences, Hokkaido University , Sapporo 060-0810 , Japan
| | - Kanta Terao
- 2 Graduate School of Life Science, Hokkaido University , Sapporo 060-0810 , Japan
| | - Misato Sato
- 2 Graduate School of Life Science, Hokkaido University , Sapporo 060-0810 , Japan
| | - Yukihisa Matsumoto
- 1 Faculty of Science, Hokkaido University , Sapporo 060-0810 , Japan.,5 College of Liberal Arts and Science, Tokyo Medical and Dental University , Ichikawa 272-0827 , Japan
| |
Collapse
|
7
|
Fry BR, Russell N, Gifford R, Robles CF, Manning CE, Sawa A, Niwa M, Johnson AW. Assessing Reality Testing in Mice Through Dopamine-Dependent Associatively Evoked Processing of Absent Gustatory Stimuli. Schizophr Bull 2020; 46:54-67. [PMID: 31150554 PMCID: PMC6942166 DOI: 10.1093/schbul/sbz043] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Impairments in reality testing are core features of numerous neuropsychiatric conditions. However, relatively few animal models have been developed to assess this critical facet of neuropsychiatric illness, thus impeding our understanding of the underlying central systems and circuits. Using mice in which dominant-negative Disrupted-in-Schizophrenia-1 is expressed throughout central nervous system circuitry (DN-DISC1-PrP), the capacity for an auditory conditioned stimulus (CS) to evoke perceptual processing of an absent sucrose solution was examined. At test, during CS presentations, DN-DISC1-PrP mice consumed more water and displayed a licking profile that is more typically revealed while ingesting a sweet-tasting solution. DN-DISC1-PrP mice also displayed greater c-fos expression in the insular (gustatory) cortex when consuming water in the presence of the CS. This capacity for the CS to more readily substitute for the taste features of the absent sucrose solution in DN-DISC1-PrP mice was attenuated following systemic treatment with the antipsychotic haloperidol. Conversely, social isolation during adolescence promoted the manifestation of these effects. These results provide strong validation for using associative learning procedures to examine dopamine-mediated reality testing associated with insular cortex activation.
Collapse
Affiliation(s)
- Benjamin R Fry
- Department of Psychology, Michigan State University, East Lansing, MI
| | - Nicollette Russell
- Department of Psychology, Michigan State University, East Lansing, MI,Neuroscience Program, Michigan State University, East Lansing, MI
| | - Ryan Gifford
- Department of Psychology, Michigan State University, East Lansing, MI
| | - Cindee F Robles
- Department of Psychology, Michigan State University, East Lansing, MI
| | - Claire E Manning
- Neuroscience Program, Michigan State University, East Lansing, MI
| | - Akira Sawa
- Department of Psychiatry, Johns Hopkins University School of Medicine, Baltimore, MD,Department of Neuroscience, Biomedical Engineering, and Genetic Medicine, Johns Hopkins University School of Medicine. Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD
| | - Minae Niwa
- Department of Psychiatry, Johns Hopkins University School of Medicine, Baltimore, MD,Present address: Department of Psychiatry and Behavioral Neurobiology, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL
| | - Alexander W Johnson
- Department of Psychology, Michigan State University, East Lansing, MI,Neuroscience Program, Michigan State University, East Lansing, MI,To whom correspondence should be addressed; tel: +1-517-432-8446; fax: +1-517-432-4744, e-mail:
| |
Collapse
|
8
|
Pool ER, Pauli WM, Kress CS, O'Doherty JP. Behavioural evidence for parallel outcome-sensitive and outcome-insensitive Pavlovian learning systems in humans. Nat Hum Behav 2019; 3:284-296. [PMID: 30882043 PMCID: PMC6416744 DOI: 10.1038/s41562-018-0527-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 12/21/2018] [Indexed: 02/07/2023]
Abstract
There is a dichotomy in instrumental conditioning between goal-directed actions and habits that are distinguishable on the basis of their relative sensitivity to changes in outcome value. It is less clear whether a similar distinction applies in Pavlovian conditioning, where responses have been found to be predominantly outcome sensitive. To test for both devaluation insensitive and devaluation sensitive Pavlovian conditioning in humans, we conducted four experiments combining Pavlovian conditioning and outcome devaluation procedures while measuring multiple conditioned responses. Our results suggest that Pavlovian conditioning involves two distinct types of learning: one that learns the current value of the outcome which is sensitive to devaluation, and one that learns about the spatial localisation of the outcome which is insensitive to devaluation. Our findings have implications for the mechanistic understanding of Pavlovian conditioning and provide a more nuanced understanding of Pavlovian mechanisms that might contribute to a number of psychiatric disorders.
Collapse
Affiliation(s)
- Eva R Pool
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA.
| | - Wolfgang M Pauli
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, CA, USA
| | - Carolina S Kress
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
| | - John P O'Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, CA, USA
| |
Collapse
|
9
|
Keiflin R, Pribut HJ, Shah NB, Janak PH. Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr Biol 2018; 29:93-103.e3. [PMID: 30581025 DOI: 10.1016/j.cub.2018.11.050] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Revised: 10/17/2018] [Accepted: 11/20/2018] [Indexed: 02/07/2023]
Abstract
Dopamine (DA) neurons in the ventral tegmental area (VTA) and substantia nigra (SNc) encode reward prediction errors (RPEs) and are proposed to mediate error-driven learning. However, the learning strategy engaged by DA-RPEs remains controversial. RPEs might imbue predictive cues with pure value, independently of representations of their associated outcome. Alternatively, RPEs might promote learning about the sensory features (the identity) of the rewarding outcome. Here, we show that, although both VTA and SNc DA neuron activation reinforces instrumental responding, only VTA DA neuron activation during consumption of expected sucrose reward restores error-driven learning and promotes formation of a new cue→sucrose association. Critically, expression of VTA DA-dependent Pavlovian associations is abolished following sucrose devaluation, a signature of identity-based learning. These findings reveal that activation of VTA- or SNc-DA neurons engages largely dissociable learning processes with VTA-DA neurons capable of participating in outcome-specific predictive learning, and the role of SNc-DA neurons appears limited to reinforcement of instrumental responses.
Collapse
Affiliation(s)
- Ronald Keiflin
- Department of Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD 21218, USA.
| | - Heather J Pribut
- Department of Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Nisha B Shah
- Department of Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Patricia H Janak
- Department of Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD 21218, USA; The Solomon H. Snyder Department of Neuroscience, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA; Kavli Neuroscience Discovery Institute, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
10
|
Contributions of Pavlovian incentive motivation to cue-potentiated feeding. Sci Rep 2018; 8:2766. [PMID: 29426846 PMCID: PMC5807356 DOI: 10.1038/s41598-018-21046-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 01/29/2018] [Indexed: 11/25/2022] Open
Abstract
Cues signaling the availability of palatable food acquire the ability to potentiate food seeking and consumption. The current study employed a combination of behavioral, pharmacological, and analytical techniques to probe the role of Pavlovian incentive motivation in cue-potentiated feeding. We show that a cue paired with sucrose solution (CS+) can transfer its control over feeding to stimulate sucrose consumption at a new receptacle, and that this effect depends on activation of D1 dopamine receptors, which is known to modulate other forms of cue-motivated behavior but not taste palatability. Microstructural analyses of sucrose-licking behavior revealed that the CS+ tended to increase the frequency with which rats engaged in active bouts of licking behavior without having a reliable effect on the duration of those licking bouts, a measure that was instead associated with sucrose palatability. Furthermore, we found that individual differences in CS+ elicited increases in bout frequency were associated with total sucrose intake at test, supporting the view that this process was related to meaningful dysregulation of eating behavior. The current study, therefore, (1) demonstrates that a dopamine-dependent Pavlovian incentive motivational process can mediate cue-potentiated feeding, and (2) lays out an experimental and analytical approach for parsing this aspect of behavior.
Collapse
|
11
|
Abstract
Powers et al. (2017, Science, 357(6351), 596-600) report that Pavlovian conditioning can result in the perception of a stimulus in its absence, and that this effect is related to hallucinations outside the laboratory. Considered alongside similar studies in animals, this suggests that associatively produced perceptual processing offers a means to study hallucination-like behaviour in the animal laboratory.
Collapse
|
12
|
Johnson AW. Examining the influence of CS duration and US density on cue-potentiated feeding through analyses of licking microstructure. LEARNING AND MOTIVATION 2018; 61:85-96. [PMID: 30082927 PMCID: PMC6075650 DOI: 10.1016/j.lmot.2017.07.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In the current study, groups of mice were trained with either short (20 s) or long (120 s) conditioned stimulus (CS) durations associated with different rates of sucrose unconditioned stimulus (US) delivery, to examine whether different behavioral forms of cue-potentiated feeding in sated mice would be evoked. In training mice received presentations of an auditory CS for 20 s during which a sucrose US was delivered at a density of 1/9 s (Group-20-s). A second group of mice received an auditory CS for 120 s and a US density of 1/49 s (Group-120-s). During training, a shorter CS duration and higher rate of US delivery resulted in greater acquisition of food cup responding, and during the test stage Group-20-s mice also displayed higher CS evoked lick rates, though all mice showed cue-potentiated feeding. An analysis of licking microstructure also revealed that Group-120-s mice displayed CS evoked licking behavior that reflected an increase in the perceived palatability of the sucrose US. These findings are discussed with respect to the influence of CS interval and US density on associatively activated sensory and affective representations of a US, and contrast mediated effects resulting from presentation of excitatory and inhibitory conditioned stimuli.
Collapse
Affiliation(s)
- Alexander W Johnson
- Department of Psychology and Neuroscience Program, Michigan State University, East Lansing, MI, 48864, USA
| |
Collapse
|
13
|
Model-based and model-free Pavlovian reward learning: revaluation, revision, and revelation. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2015; 14:473-92. [PMID: 24647659 DOI: 10.3758/s13415-014-0277-8] [Citation(s) in RCA: 184] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Evidence supports at least two methods for learning about reward and punishment and making predictions for guiding actions. One method, called model-free, progressively acquires cached estimates of the long-run values of circumstances and actions from retrospective experience. The other method, called model-based, uses representations of the environment, expectations, and prospective calculations to make cognitive predictions of future value. Extensive attention has been paid to both methods in computational analyses of instrumental learning. By contrast, although a full computational analysis has been lacking, Pavlovian learning and prediction has typically been presumed to be solely model-free. Here, we revise that presumption and review compelling evidence from Pavlovian revaluation experiments showing that Pavlovian predictions can involve their own form of model-based evaluation. In model-based Pavlovian evaluation, prevailing states of the body and brain influence value computations, and thereby produce powerful incentive motivations that can sometimes be quite new. We consider the consequences of this revised Pavlovian view for the computational landscape of prediction, response, and choice. We also revisit differences between Pavlovian and instrumental learning in the control of incentive motivation.
Collapse
|
14
|
Smith KS, Graybiel AM. Investigating habits: strategies, technologies and models. Front Behav Neurosci 2014; 8:39. [PMID: 24574988 PMCID: PMC3921576 DOI: 10.3389/fnbeh.2014.00039] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 01/25/2014] [Indexed: 12/27/2022] Open
Abstract
Understanding habits at a biological level requires a combination of behavioral observations and measures of ongoing neural activity. Theoretical frameworks as well as definitions of habitual behaviors emerging from classic behavioral research have been enriched by new approaches taking account of the identification of brain regions and circuits related to habitual behavior. Together, this combination of experimental and theoretical work has provided key insights into how brain circuits underlying action-learning and action-selection are organized, and how a balance between behavioral flexibility and fixity is achieved. New methods to monitor and manipulate neural activity in real time are allowing us to have a first look “under the hood” of a habit as it is formed and expressed. Here we discuss ideas emerging from such approaches. We pay special attention to the unexpected findings that have arisen from our own experiments suggesting that habitual behaviors likely require the simultaneous activity of multiple distinct components, or operators, seen as responsible for the contrasting dynamics of neural activity in both cortico-limbic and sensorimotor circuits recorded concurrently during different stages of habit learning. The neural dynamics identified thus far do not fully meet expectations derived from traditional models of the structure of habits, and the behavioral measures of habits that we have made also are not fully aligned with these models. We explore these new clues as opportunities to refine an understanding of habits.
Collapse
Affiliation(s)
- Kyle S Smith
- Department of Psychological and Brain Sciences, Dartmouth College Hanover, NH, USA
| | - Ann M Graybiel
- Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research, Massachusetts Institute of Technology Cambridge, MA, USA
| |
Collapse
|
15
|
Abstract
Information is encoded in the nervous system through the discharge and synchronization of single neurons. The striatum, the input stage of the basal ganglia, is divided into three territories: the putamen, the caudate, and the ventral striatum, all of which converge onto the same motor pathway. This parallel organization suggests that there are multiple and competing systems in the basal ganglia network controlling behavior. To explore which mechanism(s) enables the different striatal domains to encode behavioral events and to control behavior, we compared the neural activity of phasically active neurons [medium spiny neurons (MSNs), presumed projection neurons] and tonically active neurons (presumed cholinergic interneurons) across striatal territories from monkeys during the performance of a well practiced task. Although neurons in all striatal territories displayed similar spontaneous discharge properties and similar temporal modulations of their discharge rates to the behavioral events, their correlation structure was profoundly different. The distributions of signal and noise correlation of pairs of putamen MSNs were strongly shifted toward positive correlations and these two measures were correlated. In contrast, MSN pairs in the caudate and ventral striatum displayed symmetrical, near-zero signal and noise correlation distributions. Furthermore, only putamen MSN pairs displayed different noise correlation dynamics to rewarding versus neutral/aversive cues. Similarly, the noise correlation between tonically active neuron pairs was stronger in the putamen than in the caudate. We suggest that the level of synchronization of the neuronal activity and its temporal dynamics differentiate the striatal territories and may thus account for the different roles that striatal domains play in behavioral control.
Collapse
|
16
|
Intra-accumbens baclofen, but not muscimol, increases second order instrumental responding for food reward in rats. PLoS One 2012; 7:e40057. [PMID: 22808090 PMCID: PMC3392280 DOI: 10.1371/journal.pone.0040057] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2011] [Accepted: 05/31/2012] [Indexed: 12/15/2022] Open
Abstract
Stimulation of either GABAA or GABAB receptors within the nucleus accumbens shell strongly enhances food intake in rats. However the effects of subtype-selective stimulation of GABA receptors on instrumental responses for food reward are less well characterized. Here we contrast the effects of the GABAA receptor agonist muscimol and GABAB receptor agonist baclofen on instrumental responding for food using a second order reinforcement schedule. Bilateral intra-accumbens administration of baclofen (220–440 pmol) stimulated responding but a higher dose (660 pmol) induced stereotyped oral behaviour that interfered with responding. Baclofen (220–660 pmol) also stimulated intake of freely available chow. Muscimol (220–660 pmol) was without effect on responding for food on this schedule but did stimulate intake of freely available chow. Unilateral administration of either baclofen or muscimol (220 pmol) induced similar patterns of c-fos immunoreactivity in several hypothalamic sites but differed in its induction in the central nucleus of the amygdala. We conclude that stimulation of GABAA or GABAB receptors in the nucleus accumbens shell of rats produces clearly distinguishable effects on operant responding for food.
Collapse
|
17
|
Abstract
A significant problem in the study of Pavlovian conditioning is characterizing the nature of the representations of events that enter into learning. This issue has been explored extensively with regard to the question of what features of the unconditioned stimulus enter into learning, but considerably less work has been directed to the question of characterizing the nature of the conditioned stimulus. This article introduces a multilayered connectionist network approach to understanding how "perceptual" or "conceptual" representations of the conditioned stimulus might emerge from conditioning and participate in various learning phenomena. The model is applied to acquired equivalence/distinctiveness of cue effects, as well as a variety of conditional discrimination learning tasks (patterning, biconditional, ambiguous occasion setting, feature discriminations). In addition, studies that have examined what aspects of the unconditioned stimulus enter into learning are also reviewed. Ultimately, it is concluded that adopting a multilayered connectionist network perspective of Pavlovian learning provides us with a richer way in which to view basic learning processes, but a number of key theoretical problems remain to be solved, particularly as they relate to the integration of what we know about the nature of the representations of conditioned and unconditioned stimuli.
Collapse
|
18
|
Effects of systemic or nucleus accumbens-directed dopamine D1 receptor antagonism on sucrose seeking in rats. Psychopharmacology (Berl) 2011; 216:219-33. [PMID: 21318562 PMCID: PMC3120924 DOI: 10.1007/s00213-011-2210-y] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 01/25/2011] [Indexed: 12/14/2022]
Abstract
RATIONALE Conditioned cues can elicit relapse to drug- and food-seeking behavior over prolonged periods of abstinence. If seeking behavior depends on mesolimbic dopamine D1 receptors, blocking these receptors should reduce seeking behavior. OBJECTIVES We examined the effects of either systemic or intra-nucleus accumbens administration of the D1 antagonist SCH 23390 on extinction responding (sucrose seeking) by rats either 1 or 30 days into forced abstinence. MATERIALS AND METHODS Rats self-administered 10% sucrose paired with a tone + light cue for 10 days. After either 1 or 30 days of forced abstinence, rats received systemic (0, 1, 5, or 25 μg/kg IP) or bilateral nucleus accumbens core or shell (0.3 or 0.6 μg/site) injections of SCH 23390 prior to extinction testing. RESULTS Saline-treated rats responded more during extinction following 30 vs. 1 day of forced abstinence ("incubation of craving"). Systemic SCH 23390 reduced sucrose seeking after 1 day of forced abstinence, significantly reducing responding following pretreatment with 1, 5, and 25 μg/kg SCH 23390, but only 25 μg/kg significantly reduced sucrose seeking after 30 days of forced abstinence. SCH 23390 (0.3 or 0.6 μg/site) in the core or shell of the nucleus accumbens reduced sucrose seeking in all groups. CONCLUSION Nucleus accumbens D1 receptors are involved in sucrose seeking, but it is not clear if they are involved in the incubation of craving. The fact that D1 antagonism reduced sucrose seeking across an extended period of abstinence may be of use for development of treatment strategies for relapse.
Collapse
|
19
|
Harkness JH, Webb S, Grimm JW. Abstinence-dependent transfer of lithium chloride-induced sucrose aversion to a sucrose-paired cue in rats. Psychopharmacology (Berl) 2010; 208:521-30. [PMID: 20039021 PMCID: PMC2819267 DOI: 10.1007/s00213-009-1755-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Accepted: 12/10/2009] [Indexed: 10/20/2022]
Abstract
RATIONALE Responding for a drug- or sucrose-paired cue increases over forced abstinence (incubation of craving). If the incentive value of a cue depends on the incentive value of the primary reward, devaluing the primary reward should reduce cue reactivity. OBJECTIVES We investigated whether conditioned taste aversion (CTA) to sucrose would transfer to a sucrose-paired cue after 1 or 30 days of forced abstinence and whether CTA after 1 day of forced abstinence would affect incubation of craving. MATERIALS AND METHODS Rats self-administered 10% sucrose paired with a tone + light cue for 10 days. After 1 (Exp.1) or 30 (Exp.2) days of forced abstinence, rats received two home-cage pairings of sucrose with either LiCl (65 mg/kg, IP) to produce CTA or saline as a control. Two days later, rats responded for the cue alone. The following day, sucrose consumption was assessed in the same operant conditioning chamber. Exp.1 rats were tested again 1 month later to determine if CTA would affect incubation of craving. RESULTS Exp.1: CTA after 1 day of forced abstinence did not attenuate cue reactivity when tested immediately after CTA, nor did the treatment affect incubation of craving or incubation of sucrose consumption. Exp.2: CTA after 1 month of forced abstinence resulted in a significant reduction in cue reactivity. CONCLUSION The incentive values of sucrose and the conditioned representation of sucrose increase over an extended period of forced abstinence. This incubation appears to facilitate the transfer of an aversion to the primary reward to the conditioned cue.
Collapse
|
20
|
Jonkman S, Kosaki Y, Everitt BJ, Dickinson A. The role of contextual conditioning in the effect of reinforcer devaluation on instrumental performance by rats. Behav Processes 2010; 83:276-81. [PMID: 20060882 DOI: 10.1016/j.beproc.2009.12.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2009] [Revised: 11/23/2009] [Accepted: 12/24/2009] [Indexed: 10/20/2022]
Abstract
Different groups of rats received different amounts of training to lever press for a food reinforcer before an aversion was conditioned to the food. This devaluation of the reinforcer reduced responding in both subsequent extinction and reinforced tests of responding to a degree that was independent of the amount of instrumental training. Moreover, interpolating context extinction between aversion conditioning and the extinction test reduced the magnitude of the devaluation effect, thereby indicating that Pavlovian contextual conditioning may play a role in the instrumental devaluation effect.
Collapse
Affiliation(s)
- Sietse Jonkman
- Behavioural and Clinical Neuroscience Institute and Department of Experimental Psychology, University of Cambridge, Downing Site, Cambridge, United Kingdom.
| | | | | | | |
Collapse
|
21
|
Abstract
Pavlovian cues for rewards become endowed with incentive salience, guiding "wanting" to their learned reward. Usually, cues are "wanted" only if their rewards have ever been "liked," but here we show that mesocorticolimbic systems can recompute "wanting" de novo by integrating novel physiological signals with a cue's preexisting associations to an outcome that lacked hedonic value. That is, a cue's incentive salience can be recomputed adaptively. We demonstrate that this recomputation is encoded in neural signals coursing through the ventral pallidum. Ventral pallidum neurons do not ordinarily fire vigorously to a cue that predicts the previously "disliked" taste of intense salt, although they do fire to a cue that predicts the taste of previously "liked" sucrose. Yet we show that neural firing rises dramatically to the salt cue immediately and selectively when that cue is encountered in a never-before-experienced state of physiological salt depletion. Crucially, robust neural firing to the salt cue occurred the first time it was encountered in the new depletion state (in cue-only extinction trials), even before its associated intense saltiness has ever been tasted as positively "liked" (salt taste had always been "disliked" before). The amplification of incentive salience did not require additional learning about the cue or the newly positive salt taste. Thus dynamic recomputation of cue-triggered "wanting" signals can occur in real time at the moment of cue re-encounter by combining previously learned Pavlovian associations with novel physiological information about a current state of specific appetite.
Collapse
|
22
|
Iguchi Y, Ishii K. Instrumental outcome devaluation with representation-mediated conditioning. Behav Processes 2009; 81:95-104. [PMID: 19429203 DOI: 10.1016/j.beproc.2009.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2008] [Revised: 01/20/2009] [Accepted: 02/20/2009] [Indexed: 11/19/2022]
Abstract
In three experiments, rats were trained to perform two instrumental behaviours (R1 and R2) in the presence of discriminative stimuli (Sd1 and Sd2, respectively) to obtain a common food outcome (O1). Acquisition of the two discriminations was followed by switching the outcome accompanying R2 performance from O1 to a new one (O2). Experiment 1 showed paired presentations of O2 with a lithium chloride (LiCl) injection resulted in a reduction in the R2 performance. In the subsequent two experiments, each Sd was paired with LiCl injection and its effects on outcome consumption and instrumental performance were investigated. A reduction in the O2 consumption subsequent to the Sd devaluation was found in Experiments 2 and 3. Experiment 3 revealed a reduced R2 performance in an extinction test, following the animals' consummatory access to the outcomes in training context. These results demonstrate representation-mediated outcome devaluation in the course of the Sd devaluation.
Collapse
Affiliation(s)
- Yoshio Iguchi
- Department of Psychology, School of Humanities, College of Human & Social Sciences, Kanazawa University, Kanazawa, Ishikawa, Japan.
| | | |
Collapse
|
23
|
Holland PC. A comparison of two methods of assessing representation-mediated food aversions based on shock or illness. LEARNING AND MOTIVATION 2008; 39:265-277. [PMID: 19884955 PMCID: PMC2598752 DOI: 10.1016/j.lmot.2008.08.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
In experiments that measured food consumption, Holland (1981; Learning and Motivation, 12, 1-18) found that food aversions were formed when an exteroceptive associate of food was paired with illness, but not when such an associate was paired with shock. By contrast, measuring the ability of food to reinforce instrumental responding, Ward-Robinson and Hall (1999; Quarterly Journal of Experimental Psychology, 52B, 335-350) found that pairing an associatively-activated representation of food with shock readily established an aversion to that food. Two experiments considered the origins of these apparently discrepant results. The results did not support either the possibility that instrumental reinforcement power is a more sensitive measure of aversion learning than consumption, nor the hypothesis that illness particularly devalues properties of food representations that determine consumption (such as palatability) whereas shock devalues more general properties critical to reinforcement. The results suggested instead that whereas the effects of pairings of a food associate with illness are mediated by changes in the value of the food itself, the effects of pairings with shock are mediated by the conditioning of fear or other competing responses to the site of food delivery, and not by modification of the value of food itself.
Collapse
|
24
|
Abstract
In his 1948 address to the Division of Theoretical-Experimental Psychology of the American Psychological Association, Kenneth W. Spence discussed six distinctions between cognitive and stimulus-response (S-R) theories of learning. In this article, I first review these six distinctions and then focus on two of them in the context of my own research. This research concerns the specification of stimulus-stimulus associations in associative learning and the characterization of the neural systems underlying those associations. In the course of describing Spence's views and my research, I hope to communicate some of the richness of Spence's S-R psychology and its currency within modern scientific analyses of behavior.
Collapse
Affiliation(s)
- Peter C Holland
- Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, Maryland 21218, USA.
| |
Collapse
|