1
|
Carter F, Cossette MP, Trujillo-Pisanty I, Pallikaras V, Breton YA, Conover K, Caplan J, Solis P, Voisard J, Yaksich A, Shizgal P. Does phasic dopamine release cause policy updates? Eur J Neurosci 2024; 59:1260-1277. [PMID: 38039083 DOI: 10.1111/ejn.16199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/21/2023] [Accepted: 11/01/2023] [Indexed: 12/03/2023]
Abstract
Phasic dopamine activity is believed to both encode reward-prediction errors (RPEs) and to cause the adaptations that these errors engender. If so, a rat working for optogenetic stimulation of dopamine neurons will repeatedly update its policy and/or action values, thus iteratively increasing its work rate. Here, we challenge this view by demonstrating stable, non-maximal work rates in the face of repeated optogenetic stimulation of midbrain dopamine neurons. Furthermore, we show that rats learn to discriminate between world states distinguished only by their history of dopamine activation. Comparison of these results to reinforcement learning simulations suggests that the induced dopamine transients acted more as rewards than RPEs. However, pursuit of dopaminergic stimulation drifted upwards over a time scale of days and weeks, despite its stability within trials. To reconcile the results with prior findings, we consider multiple roles for dopamine signalling.
Collapse
Affiliation(s)
- Francis Carter
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
- Montreal Institute for Learning Algorithms, Université de Montréal, Montreal, Quebec, Canada
| | | | - Ivan Trujillo-Pisanty
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
- Department of Psychology, Langara College, Vancouver, British Columbia, Canada
| | | | | | - Kent Conover
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Jill Caplan
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Pavel Solis
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Jacques Voisard
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Alexandra Yaksich
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| | - Peter Shizgal
- Department of Psychology, Concordia University, Montreal, Quebec, Canada
| |
Collapse
|
2
|
Evangelista C, Mehrez N, Boisvert EE, Brake WG, Shizgal P. The priming effect of rewarding brain stimulation in rats depends on both the cost and strength of reward but survives blockade of D2-like dopamine receptors. Eur J Neurosci 2023; 58:3751-3784. [PMID: 37752810 DOI: 10.1111/ejn.16142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 07/18/2023] [Accepted: 08/24/2023] [Indexed: 09/28/2023]
Abstract
Receipt of an intense reward boosts motivation to work for more of that reward. This phenomenon is called the priming effect of rewards. Using a novel measurement method, we show that the priming effect of rewarding electrical brain stimulation depends on the cost, as well as on the strength, of the anticipated reward. Previous research on the priming effect of electrical brain stimulation utilized a runway paradigm in which running speed serves as the measure of motivation. In the present study, the measure of motivation was the vigour with which rats executed a two-lever response chain, in a standard operant-conditioning chamber, to earn rewarding electrical stimulation of the lateral hypothalamus. In a second experiment, we introduced a modification that entails self-administered priming stimulation and alternating blocks of primed and unprimed trials. Reliable, consistent priming effects of substantial magnitude were obtained in the modified paradigm, which is closely analogous to the runway paradigm. In a third experiment, the modified paradigm served to assess the dependence of the priming effect on dopamine D2-like receptors. The priming effect proved resilient to the effect of eticlopride, a selective D2-like receptor antagonist. These results are discussed within the framework of a new model of brain reward circuitry in which non-dopaminergic medial forebrain bundle fibers and dopamine axons provide parallel inputs to the final common paths for reward and incentive motivation.
Collapse
Affiliation(s)
- Czarina Evangelista
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Norhan Mehrez
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Esthelle Ewusi Boisvert
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Wayne G Brake
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Peter Shizgal
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montréal, Québec, Canada
| |
Collapse
|
3
|
Pallikaras V, Shizgal P. The Convergence Model of Brain Reward Circuitry: Implications for Relief of Treatment-Resistant Depression by Deep-Brain Stimulation of the Medial Forebrain Bundle. Front Behav Neurosci 2022; 16:851067. [PMID: 35431828 PMCID: PMC9011331 DOI: 10.3389/fnbeh.2022.851067] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 02/10/2022] [Indexed: 12/28/2022] Open
Abstract
Deep-brain stimulation of the medial forebrain bundle (MFB) can provide effective, enduring relief of treatment-resistant depression. Panksepp provided an explanatory framework: the MFB constitutes the core of the neural circuitry subserving the anticipation and pursuit of rewards: the “SEEKING” system. On that view, the SEEKING system is hypoactive in depressed individuals; background electrical stimulation of the MFB alleviates symptoms by normalizing activity. Panksepp attributed intracranial self-stimulation to excitation of the SEEKING system in which the ascending projections of midbrain dopamine neurons are an essential component. In parallel with Panksepp’s qualitative work, intracranial self-stimulation has long been studied quantitatively by psychophysical means. That work argues that the predominant directly stimulated substrate for MFB self-stimulation are myelinated, non-dopaminergic fibers, more readily excited by brief electrical current pulses than the thin, unmyelinated axons of the midbrain dopamine neurons. The series-circuit hypothesis reconciles this view with the evidence implicating dopamine in MFB self-stimulation as follows: direct activation of myelinated MFB fibers is rewarding due to their trans-synaptic activation of midbrain dopamine neurons. A recent study in which rats worked for optogenetic stimulation of midbrain dopamine neurons challenges the series-circuit hypothesis and provides a new model of intracranial self-stimulation in which the myelinated non-dopaminergic neurons and the midbrain dopamine projections access the behavioral final common path for reward seeking via separate, converging routes. We explore the potential implications of this convergence model for the interpretation of the antidepressant effect of MFB stimulation. We also discuss the consistent finding that psychomotor stimulants, which boost dopaminergic neurotransmission, fail to provide a monotherapy for depression. We propose that non-dopaminergic MFB components may contribute to the therapeutic effect in parallel to, in synergy with, or even instead of, a dopaminergic component.
Collapse
|
4
|
Pallikaras V, Carter F, Velazquez-Martinez DN, Arvanitogiannis A, Shizgal P. The trade-off between pulse duration and power in optical excitation of midbrain dopamine neurons approximates Bloch's law. Behav Brain Res 2022; 419:113702. [PMID: 34864162 DOI: 10.1016/j.bbr.2021.113702] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 11/22/2021] [Accepted: 11/29/2021] [Indexed: 11/28/2022]
Abstract
Optogenetic experiments reveal functional roles of specific neurons. However, functional inferences have been limited by widespread adoption of a restricted set of stimulation parameters. Broader exploration of the parameter space can deepen insight into the mapping between selective neural activity and behavior. In this way, characteristics of the activated neural circuit, such as temporal integration, can be inferred. Our objective was to determine whether an equal-energy principle accounts for the interaction of pulse duration and optical power in optogenetic excitation. Six male TH::Cre rats worked for optogenetic (ChannelRhodopsin-2) stimulation of VTA dopamine neurons. We used a within-subject design to describe the trade-off between pulse duration and optical power in determining reward seeking. Parameters were customized for each subject based on behavioral effectiveness. Within a useful range of powers (~12.6-31.6 mW) the product of optical power and pulse duration required to produce a given level of reward seeking was roughly constant. Such reciprocity is consistent with Bloch's law, which posits an equal-energy principle of temporal summation over short durations in human vision. The trade-off between pulse duration and power broke down at higher powers. Thus, optical power and duration can be adjusted reciprocally for brief durations and lower powers, and power can be substituted for pulse duration to scale the region of excitation in behavioral optogenetic experiments. The findings demonstrate the utility of within-subject and trade-off designs in optogenetics and of parameter adjustment based on functional endpoints instead of physical properties of the stimulation.
Collapse
Affiliation(s)
- Vasilios Pallikaras
- Center for Studies in Behavioral Neurobiology, Department of Psychology, Concordia University, 7141 Sherbrooke St W., Montreal, QC H4B 1R6, Canada
| | - Francis Carter
- Center for Studies in Behavioral Neurobiology, Department of Psychology, Concordia University, 7141 Sherbrooke St W., Montreal, QC H4B 1R6, Canada
| | - David Natanael Velazquez-Martinez
- Center for Studies in Behavioral Neurobiology, Department of Psychology, Concordia University, 7141 Sherbrooke St W., Montreal, QC H4B 1R6, Canada
| | - Andreas Arvanitogiannis
- Center for Studies in Behavioral Neurobiology, Department of Psychology, Concordia University, 7141 Sherbrooke St W., Montreal, QC H4B 1R6, Canada
| | - Peter Shizgal
- Center for Studies in Behavioral Neurobiology, Department of Psychology, Concordia University, 7141 Sherbrooke St W., Montreal, QC H4B 1R6, Canada.
| |
Collapse
|
5
|
Trujillo-Pisanty I, Conover K, Solis P, Palacios D, Shizgal P. Dopamine neurons do not constitute an obligatory stage in the final common path for the evaluation and pursuit of brain stimulation reward. PLoS One 2020; 15:e0226722. [PMID: 32502210 PMCID: PMC7274413 DOI: 10.1371/journal.pone.0226722] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 05/05/2020] [Indexed: 11/19/2022] Open
Abstract
The neurobiological study of reward was launched by the discovery of intracranial self-stimulation (ICSS). Subsequent investigation of this phenomenon provided the initial link between reward-seeking behavior and dopaminergic neurotransmission. We re-evaluated this relationship by psychophysical, pharmacological, optogenetic, and computational means. In rats working for direct, optical activation of midbrain dopamine neurons, we varied the strength and opportunity cost of the stimulation and measured time allocation, the proportion of trial time devoted to reward pursuit. We found that the dependence of time allocation on the strength and cost of stimulation was similar formally to that observed when electrical stimulation of the medial forebrain bundle served as the reward. When the stimulation is strong and cheap, the rats devote almost all their time to reward pursuit; time allocation falls off as stimulation strength is decreased and/or its opportunity cost is increased. A 3D plot of time allocation versus stimulation strength and cost produces a surface resembling the corner of a plateau (the “reward mountain”). We show that dopamine-transporter blockade shifts the mountain along both the strength and cost axes in rats working for optical activation of midbrain dopamine neurons. In contrast, the same drug shifted the mountain uniquely along the opportunity-cost axis when rats worked for electrical MFB stimulation in a prior study. Dopamine neurons are an obligatory stage in the dominant model of ICSS, which positions them at a key nexus in the final common path for reward seeking. This model fails to provide a cogent account for the differential effect of dopamine transporter blockade on the reward mountain. Instead, we propose that midbrain dopamine neurons and neurons with non-dopaminergic, MFB axons constitute parallel limbs of brain-reward circuitry that ultimately converge on the final-common path for the evaluation and pursuit of rewards.
Collapse
Affiliation(s)
- Ivan Trujillo-Pisanty
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada
| | - Kent Conover
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada
| | - Pavel Solis
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada
| | - Daniel Palacios
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada
| | - Peter Shizgal
- Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada
- * E-mail:
| |
Collapse
|
6
|
Ahilan S, Solomon RB, Breton YA, Conover K, Niyogi RK, Shizgal P, Dayan P. Learning to use past evidence in a sophisticated world model. PLoS Comput Biol 2019; 15:e1007093. [PMID: 31233559 PMCID: PMC6611652 DOI: 10.1371/journal.pcbi.1007093] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Revised: 07/05/2019] [Accepted: 05/09/2019] [Indexed: 12/02/2022] Open
Abstract
Humans and other animals are able to discover underlying statistical structure in their environments and exploit it to achieve efficient and effective performance. However, such structure is often difficult to learn and use because it is obscure, involving long-range temporal dependencies. Here, we analysed behavioural data from an extended experiment with rats, showing that the subjects learned the underlying statistical structure, albeit suffering at times from immediate inferential imperfections as to their current state within it. We accounted for their behaviour using a Hidden Markov Model, in which recent observations are integrated with evidence from the past. We found that over the course of training, subjects came to track their progress through the task more accurately, a change that our model largely attributed to improved integration of past evidence. This learning reflected the structure of the task, decreasing reliance on recent observations, which were potentially misleading. Humans and other animals possess the remarkable ability to find and exploit patterns and structures in their experience of a complex and varied world. However, such structures are often temporally extended and latent or hidden, being only partially correlated with immediate observations of the world. This makes it essential to integrate current and historical information, and creates a challenging statistical and computational problem. Here, we examine the behaviour of rats facing a version of this challenge posed by a brain-stimulation reward task. We find that subjects learned the general structure of the task, but struggled when immediate observations were misleading. We captured this behaviour with a model in which subjects integrated evidence from recent observations together with evidence from the past. The subjects’ performance improved markedly over successive sessions, allowing them to overcome misleading observations. According to our model, this was made possible by more effective usage of past evidence to better determine the true state of the world.
Collapse
Affiliation(s)
- Sanjeevan Ahilan
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- * E-mail:
| | - Rebecca B. Solomon
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Canada
| | - Yannick-André Breton
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Canada
| | - Kent Conover
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Canada
| | - Ritwik K. Niyogi
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Peter Shizgal
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Canada
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| |
Collapse
|
7
|
Solomon RB, Conover K, Shizgal P. Valuation of opportunity costs by rats working for rewarding electrical brain stimulation. PLoS One 2017; 12:e0182120. [PMID: 28841663 PMCID: PMC5571941 DOI: 10.1371/journal.pone.0182120] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 07/12/2017] [Indexed: 11/29/2022] Open
Abstract
Pursuit of one goal typically precludes simultaneous pursuit of another. Thus, each exclusive activity entails an “opportunity cost:” the forgone benefits from the next-best activity eschewed. The present experiment estimates, in laboratory rats, the function that maps objective opportunity costs into subjective ones. In an operant chamber, rewarding electrical brain stimulation was delivered when the cumulative time a lever had been depressed reached a criterion duration. The value of the activities forgone during this duration is the opportunity cost of the electrical reward. We determined which of four functions best describes how objective opportunity costs, expressed as the required duration of lever depression, are translated into their subjective equivalents. The simplest account is the identity function, which equates subjective and objective opportunity costs. A variant of this function called the “sigmoidal-slope function,” converges on the identity function at longer durations but deviates from it at shorter durations. The sigmoidal-slope function has the form of a hockey stick. The flat “blade” denotes a range over which opportunity costs are subjectively equivalent; these durations are too short to allow substitution of more beneficial activities. The blade extends into an upward-curving portion over which costs become discriminable and finally into the straight “handle,” over which objective and subjective costs match. The two remaining functions are based on hyperbolic and exponential temporal discounting, respectively. The results are best described by the sigmoidal-slope function. That this is so suggests that different principles of intertemporal choice are involved in the evaluation of time spent working for a reward or waiting for its delivery. The subjective opportunity-cost function plays a key role in the evaluation and selection of goals. An accurate description of its form and parameters is essential to successful modeling and prediction of instrumental performance and reward-related decision making.
Collapse
Affiliation(s)
- Rebecca Brana Solomon
- Centre for Studies in Behavioural Neurobiology / Groupe de recherche en neurobiologie comportementale, Department of Psychology, Concordia University, Montréal, Québec, Canada
| | - Kent Conover
- Centre for Studies in Behavioural Neurobiology / Groupe de recherche en neurobiologie comportementale, Department of Psychology, Concordia University, Montréal, Québec, Canada
| | - Peter Shizgal
- Centre for Studies in Behavioural Neurobiology / Groupe de recherche en neurobiologie comportementale, Department of Psychology, Concordia University, Montréal, Québec, Canada
- * E-mail:
| |
Collapse
|
8
|
Waraczynski M. Toward a systems-oriented approach to the role of the extended amygdala in adaptive responding. Neurosci Biobehav Rev 2016; 68:177-194. [PMID: 27216212 DOI: 10.1016/j.neubiorev.2016.05.015] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Revised: 04/02/2016] [Accepted: 05/19/2016] [Indexed: 11/19/2022]
Abstract
Research into the structure and function of the basal forebrain macrostructure called the extended amygdala (EA) has recently seen considerable growth. This paper reviews that work, with the objectives of identifying underlying themes and developing a common goal towards which investigators of EA function might work. The paper begins with a brief review of the structure and the ontological and phylogenetic origins of the EA. It continues with a review of research into the role of the EA in both aversive and appetitive states, noting that these two seemingly disparate avenues of research converge on the concept of reinforcement - either negative or positive - of adaptive responding. These reviews lead to a proposal as to where the EA may fit in the organization of the basal forebrain, and an invitation to investigators to place their findings in a unifying conceptual framework of the EA as a collection of neural ensembles that mediate adaptive responding.
Collapse
Affiliation(s)
- Meg Waraczynski
- Department of Psychology, University of Wisconsin-Whitewater, 800 West Main Street, Whitewater, WI 53190, USA.
| |
Collapse
|
9
|
Scardochio T, Trujillo-Pisanty I, Conover K, Shizgal P, Clarke PBS. The Effects of Electrical and Optical Stimulation of Midbrain Dopaminergic Neurons on Rat 50-kHz Ultrasonic Vocalizations. Front Behav Neurosci 2015; 9:331. [PMID: 26696851 PMCID: PMC4672056 DOI: 10.3389/fnbeh.2015.00331] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 11/18/2015] [Indexed: 01/11/2023] Open
Abstract
Rationale: Adult rats emit ultrasonic vocalizations (USVs) at around 50-kHz; these commonly occur in contexts that putatively engender positive affect. While several reports indicate that dopaminergic (DAergic) transmission plays a role in the emission of 50-kHz calls, the pharmacological evidence is mixed. Different modes of dopamine (DA) release (i.e., tonic and phasic) could potentially explain this discrepancy. Objective: To investigate the potential role of phasic DA release in 50-kHz call emission. Methods: In Experiment 1, USVs were recorded in adult male rats following unexpected electrical stimulation of the medial forebrain bundle (MFB). In parallel, phasic DA release in the nucleus accumbens (NAcc) was recorded using fast-scan cyclic voltammetry. In Experiment 2, USVs were recorded following response-contingent or non-contingent optogenetic stimulation of midbrain DAergic neurons. Four 20-s schedules of optogenetic stimulation were used: fixed-interval, fixed-time, variable-interval, and variable-time. Results: Brief electrical stimulation of the MFB increased both 50-kHz call rate and phasic DA release in the NAcc. During optogenetic stimulation sessions, rats initially called at a high rate comparable to that observed following reinforcers such as psychostimulants. Although optogenetic stimulation maintained reinforced responding throughout the 2-h session, the call rate declined to near zero within the first 30 min. The trill call subtype predominated following both electrical and optical stimulation. Conclusion: The occurrence of electrically-evoked 50-kHz calls, time-locked to phasic DA (Experiment 1), provides correlational evidence supporting a role for phasic DA in USV production. However, in Experiment 2, the temporal dissociation between calling and optogenetic stimulation of midbrain DAergic neurons suggests that phasic mesolimbic DA release is not sufficient to produce 50-kHz calls. The emission of the trill subtype of 50-kHz calls potentially provides a marker distinguishing positive affect from positive reinforcement.
Collapse
Affiliation(s)
- Tina Scardochio
- Department of Pharmacology and Therapeutics, Neuropsychopharmacology, McGill University Montreal, QC, Canada
| | - Ivan Trujillo-Pisanty
- Department of Psychology, Center for Studies in Behavioral Neurobiology, Concordia University Montreal, QC, Canada
| | - Kent Conover
- Department of Psychology, Center for Studies in Behavioral Neurobiology, Concordia University Montreal, QC, Canada
| | - Peter Shizgal
- Department of Psychology, Center for Studies in Behavioral Neurobiology, Concordia University Montreal, QC, Canada
| | - Paul B S Clarke
- Department of Pharmacology and Therapeutics, Neuropsychopharmacology, McGill University Montreal, QC, Canada ; Department of Psychology, Center for Studies in Behavioral Neurobiology, Concordia University Montreal, QC, Canada
| |
Collapse
|
10
|
Cossette MP, Conover K, Shizgal P. The neural substrates for the rewarding and dopamine-releasing effects of medial forebrain bundle stimulation have partially discrepant frequency responses. Behav Brain Res 2015; 297:345-58. [PMID: 26477378 DOI: 10.1016/j.bbr.2015.10.029] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Revised: 10/09/2015] [Accepted: 10/10/2015] [Indexed: 10/22/2022]
Abstract
Midbrain dopamine neurons have long been implicated in the rewarding effect produced by electrical brain stimulation of the medial forebrain bundle (MFB). These neurons are excited trans-synaptically, but their precise role in intracranial self-stimulation (ICSS) has yet to be determined. This study assessed the hypothesis that midbrain dopamine neurons are in series with the directly stimulated substrate for self-stimulation of the MFB and either perform spatio-temporal integration of synaptic input from directly activated MFB fibers or relay the results of such integration to efferent stages of the reward circuitry. Psychometric current-frequency trade-off functions were derived from ICSS performance, and chemometric trade-off functions were derived from stimulation-induced dopamine transients in the nucleus accumbens (NAc) shell, measured by means of fast-scan cyclic voltammetry. Whereas the psychometric functions decline monotonically over a broad range of pulse frequencies and level off only at high frequencies, the chemometric functions obtained with the same rats and electrodes are either U-shaped or level off at lower pulse frequencies. This discrepancy was observed when the dopamine transients were recorded in either anesthetized or awake subjects. The lack of correspondence between the psychometric and chemometric functions is inconsistent with the hypothesis that dopamine neurons projecting to the NAc shell constitute an entire series stage of the neural circuit subserving self-stimulation of the MFB.
Collapse
Affiliation(s)
- M-P Cossette
- Center for Studies in Behavioral Neurobiology/Groupe de Recherche en Neurobiologie Comportementale, Concordia University, 7141 Sherbrooke Street West, SP-244, Montréal, Québec H4B 1R6, Canada.
| | - K Conover
- Center for Studies in Behavioral Neurobiology/Groupe de Recherche en Neurobiologie Comportementale, Concordia University, 7141 Sherbrooke Street West, SP-244, Montréal, Québec H4B 1R6, Canada.
| | - P Shizgal
- Center for Studies in Behavioral Neurobiology/Groupe de Recherche en Neurobiologie Comportementale, Concordia University, 7141 Sherbrooke Street West, SP-244, Montréal, Québec H4B 1R6, Canada.
| |
Collapse
|
11
|
Abstract
Rewards are crucial objects that induce learning, approach behavior, choices, and emotions. Whereas emotions are difficult to investigate in animals, the learning function is mediated by neuronal reward prediction error signals which implement basic constructs of reinforcement learning theory. These signals are found in dopamine neurons, which emit a global reward signal to striatum and frontal cortex, and in specific neurons in striatum, amygdala, and frontal cortex projecting to select neuronal populations. The approach and choice functions involve subjective value, which is objectively assessed by behavioral choices eliciting internal, subjective reward preferences. Utility is the formal mathematical characterization of subjective value and a prime decision variable in economic choice theory. It is coded as utility prediction error by phasic dopamine responses. Utility can incorporate various influences, including risk, delay, effort, and social interaction. Appropriate for formal decision mechanisms, rewards are coded as object value, action value, difference value, and chosen value by specific neurons. Although all reward, reinforcement, and decision variables are theoretical constructs, their neuronal signals constitute measurable physical implementations and as such confirm the validity of these concepts. The neuronal reward signals provide guidance for behavior while constraining the free will to act.
Collapse
Affiliation(s)
- Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
12
|
Psychophysical inference of frequency-following fidelity in the neural substrate for brain stimulation reward. Behav Brain Res 2015; 292:327-41. [PMID: 26057357 DOI: 10.1016/j.bbr.2015.06.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 06/01/2015] [Accepted: 06/03/2015] [Indexed: 01/16/2023]
Abstract
The rewarding effect of electrical brain stimulation has been studied extensively for 60 years, yet the identity of the underlying neural circuitry remains unknown. Previous experiments have characterized the directly stimulated ("first-stage") neurons implicated in self-stimulation of the medial forebrain bundle. Their properties are consistent with those of fine, myelinated axons, at least some of which project rostro-caudally. These properties do not match those of dopaminergic neurons. The present psychophysical experiment estimates an additional first-stage characteristic: maximum firing frequency. We test a frequency-following model that maps the experimenter-set pulse frequency into the frequency of firing induced in the directly stimulated neurons. As pulse frequency is increased, firing frequency initially increases at the same rate, then becomes probabilistic, and finally levels off. The frequency-following function is based on the counter model which holds that the rewarding effect of a pulse train is determined by the aggregate spike rate triggered in first-stage neurons during a given interval. In 7 self-stimulating rats, we measured current- vs. pulse-frequency trade-off functions. The trade-off data were well described by the frequency-following model, and its upper asymptote was approached at a median value of 360 Hz (IQR = 46 Hz). This value implies a highly excitable, non-dopaminergic population of first-stage neurons. Incorporating the frequency-following function and parameters in Shizgal's 3-dimensional reward-mountain model improves its accuracy and predictive power.
Collapse
|
13
|
Breton YA, Seeland KD, Redish AD. Aging impairs deliberation and behavioral flexibility in inter-temporal choice. Front Aging Neurosci 2015; 7:41. [PMID: 25870560 PMCID: PMC4375985 DOI: 10.3389/fnagi.2015.00041] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 03/10/2015] [Indexed: 11/13/2022] Open
Abstract
Inter-temporal choice depends on multiple, interacting systems, some of which may be compromised with age. Some of these systems may be responsible for ongoing trial-by-trial choice strategies. Some may represent the consequences of action. Some may be necessary for the coupling between anticipated consequences and strategies currently in use, flexibly guiding behavior. When faced with a difficult decision, rats will orient back and forth, a behavior termed "vicarious trial and error" (VTE). Recent experiments have linked the occurrence of VTE to hippocampal search processes and behavioral flexibility. We tested 5 month (n = 6), 9 month (n = 8) and over-27 month-old (n = 10) rats on a Spatial Adjusting Delay Discounting task to examine how aging impacted lap-by-lap strategies and VTE during inter-temporal choice. Rats chose between spatially separated food goals that provided a smaller-sooner or larger-later reward. On each lap, the delay to the larger-later reward was adjusted as a function of the rat's decisions, increasing by 1 s after delayed-side choices and decreasing by 1 s after non-delayed side choices. The strategies that aged rats used differed from those used in young and adult rats. Moreover, aged rats produced reliably more VTE behaviors, for protracted periods of time, uncoupled from behavioral flexibility.
Collapse
Affiliation(s)
| | - Kelsey D Seeland
- Department of Neuroscience, University of Minnesota Minneapolis, MN, USA
| | - A David Redish
- Department of Neuroscience, University of Minnesota Minneapolis, MN, USA
| |
Collapse
|
14
|
Niyogi RK, Shizgal P, Dayan P. Some work and some play: microscopic and macroscopic approaches to labor and leisure. PLoS Comput Biol 2014; 10:e1003894. [PMID: 25474151 PMCID: PMC4256012 DOI: 10.1371/journal.pcbi.1003894] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 09/07/2014] [Indexed: 11/17/2022] Open
Abstract
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other. Traditional accounts of partial allocation have characterised behavior on a macroscopic timescale, reporting and studying the mean times spent in work or leisure. However, averaging over the more microscopic processes that govern choices is known to pose tricky theoretical problems, and also eschews any possibility of direct contact with the neural computations involved. We develop a microscopic framework, formalized as a semi-Markov decision process with possibly stochastic choices, in which subjects approximately maximise their expected returns by making momentary commitments to one or other activity. We show macroscopic utilities that arise from microscopic ones, and demonstrate how facets such as imperfect substitutability can arise in a more straightforward microscopic manner. Dividing limited time between work and leisure when both are attractive is a common everyday decision. Rather than doing one exclusively, humans and other animals distribute their time between both. Traditional explanations of this phenomenon have studied the macroscopic average times spent in both. By contrast, we develop a microscopic framework in which we can model the real-time decisions that underpin these averages. In the framework, subjects' choices are approximately optimal, according to a natural, microscopic, utility function. We show that the assumptions of previous theories are not necessary for partial allocation to be optimal, and show possibilities and limits to the integration of macroscopic and microscopic views. Our approach opens new vistas onto the real-time processes underlying cost-benefit decision-making.
Collapse
Affiliation(s)
- Ritwik K Niyogi
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
| | - Peter Shizgal
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Quebec, Canada
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom
| |
Collapse
|
15
|
Breton YA, Conover K, Shizgal P. The effect of probability discounting on reward seeking: a three-dimensional perspective. Front Behav Neurosci 2014; 8:284. [PMID: 25202245 PMCID: PMC4142602 DOI: 10.3389/fnbeh.2014.00284] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2014] [Accepted: 08/02/2014] [Indexed: 11/13/2022] Open
Abstract
Rats will work for electrical stimulation of the medial forebrain bundle. The rewarding effect arises from the volleys of action potentials fired by the stimulation and subsequent spatio-temporal integration of their post-synpatic impact. The proportion of time allocated to self-stimulation depends on the intensity of the rewarding effect as well as on other key determinants of decision-making, such as subjective opportunity costs and reward probability. We have proposed that a 3D model relating time allocation to the intensity and cost of reward can distinguish manipulations acting prior to the output of the spatio-temporal integrator from those acting at or beyond it. Here, we test this proposition by varying reward probability, a variable that influences the computation of payoff in the 3D model downstream from the output of the integrator. On riskless trials, reward was delivered on every occasion that the rat held down the lever for a cumulative duration called the “price,” whereas on risky trials, reward was delivered with probability 0.75 or 0.50. According to the model, the 3D structure relating time allocation to reward intensity and price is shifted leftward along the price axis by reductions in reward probability; the magnitude of the shift estimates the change in subjective probability. The predictions were borne out: reducing reward probability shifted the 3D structure systematically along the price axis while producing only small, inconsistent displacements along the pulse-frequency axis. The results confirm that the model can accurately distinguish manipulations acting at or beyond the spatio-temporal integrator and strengthen the conclusions of previous studies showing similar shifts following dopaminergic manipulations. Subjective and objective reward probabilities appeared indistinguishable over the range of 0.5 ≤ p ≤ 1.0.
Collapse
Affiliation(s)
- Yannick-André Breton
- Department of Neuroscience, University of Minnesota, Twin Cities Minneapolis, MN, USA
| | - Kent Conover
- Groupe de Recherche en Neurobiologie Comportementale, Department of Psychology, Center for Studies in Behavioural Neurobiology, Concordia University Montreal, QC, Canada
| | - Peter Shizgal
- Groupe de Recherche en Neurobiologie Comportementale, Department of Psychology, Center for Studies in Behavioural Neurobiology, Concordia University Montreal, QC, Canada
| |
Collapse
|
16
|
Niyogi RK, Breton YA, Solomon RB, Conover K, Shizgal P, Dayan P. Optimal indolence: a normative microscopic approach to work and leisure. J R Soc Interface 2014; 11:20130969. [PMID: 24284898 PMCID: PMC3869171 DOI: 10.1098/rsif.2013.0969] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2013] [Accepted: 11/07/2013] [Indexed: 11/12/2022] Open
Abstract
Dividing limited time between work and leisure when both have their attractions is a common everyday decision. We provide a normative control-theoretic treatment of this decision that bridges economic and psychological accounts. We show how our framework applies to free-operant behavioural experiments in which subjects are required to work (depressing a lever) for sufficient total time (called the price) to receive a reward. When the microscopic benefit-of-leisure increases nonlinearly with duration, the model generates behaviour that qualitatively matches various microfeatures of subjects' choices, including the distribution of leisure bout durations as a function of the pay-off. We relate our model to traditional accounts by deriving macroscopic, molar, quantities from microscopic choices.
Collapse
Affiliation(s)
- Ritwik K. Niyogi
- Gatsby Computational Neuroscience Unit, University College London, London, UK
| | - Yannick-Andre Breton
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Quebec, Canada
| | - Rebecca B. Solomon
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Quebec, Canada
| | - Kent Conover
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Quebec, Canada
| | - Peter Shizgal
- Center for Studies in Behavioral Neurobiology, Concordia University, Montreal, Quebec, Canada
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, University College London, London, UK
| |
Collapse
|
17
|
Trujillo-Pisanty I, Conover K, Shizgal P. A new view of the effect of dopamine receptor antagonism on operant performance for rewarding brain stimulation in the rat. Psychopharmacology (Berl) 2013; 231:1351-1364. [PMID: 24232443 DOI: 10.1007/s00213-013-3328-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Accepted: 10/08/2013] [Indexed: 11/30/2022]
Abstract
RATIONALE Previous studies of neuroleptic challenges to intracranial self-stimulation (ICSS) employed two-dimensional (2D) measurements (curve shifts). Results so obtained are ambiguous with regard to the stage of neural processing at which the drug produces its performance-altering effect. We substituted a three-dimensional (3D) method that measures reward-seeking as a function of both the strength and cost of reward. This method reveals whether changes in reward seeking are due to drug action prior to the output of the circuitry that performs spatiotemporal integration of the stimulation-induced neural activity. OBJECTIVES The aim of this study was to obtain new information about the stage of neural processing at which pimozide acts to alter pursuit of brain stimulation reward (BSR). METHODS Following treatment with pimozide (0.1 mg/kg) or its vehicle, the proportion of trial time allocated to working for BSR was measured as a function of pulse frequency and opportunity cost. A surface defined by Shizgal's reward-mountain model was fitted to the drug and vehicle data. RESULTS Pimozide lowered the cost required to decrease performance for a maximal BSR to half its maximal level but did not alter the pulse-frequency required to produce a reward of half-maximal intensity. CONCLUSIONS Like indirect dopamine agonists, pimozide does not alter the sensitivity of brain reward circuity but changes reward-system gain, subjective effort costs, and/or the value of activities that compete with ICSS. The 3D method is more sensitive and informative than the 2D methods employed previously.
Collapse
Affiliation(s)
- I Trujillo-Pisanty
- Center for Studies in Behavioural Neurobiology/Groupe de Recherche en Neurobiologie Comportementale, Concordia University, 7141 Sherbrooke Street West, SP-244, Montreal, QC, H4B 1R6, Canada
| | | | | |
Collapse
|
18
|
Breton YA, Mullett A, Conover K, Shizgal P. Validation and extension of the reward-mountain model. Front Behav Neurosci 2013; 7:125. [PMID: 24098275 PMCID: PMC3787655 DOI: 10.3389/fnbeh.2013.00125] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2013] [Accepted: 09/04/2013] [Indexed: 11/13/2022] Open
Abstract
The reward-mountain model relates the vigor of reward seeking to the strength and cost of reward. Application of this model provides information about the stage of processing at which manipulations such as drug administration, lesions, deprivation states, and optogenetic interventions act to alter reward seeking. The model has been updated by incorporation of new information about frequency following in the directly stimulated neurons responsible for brain stimulation reward and about the function that maps objective opportunity costs into subjective ones. The behavioral methods for applying the model have been updated and improved as well. To assess the impact of these changes, two related predictions of the model that were supported by earlier work have been retested: (1) altering the duration of rewarding brain stimulation should change the pulse frequency required to produce a reward of half-maximal intensity, and (2) this manipulation should not change the opportunity cost at which half-maximal performance is directed at earning a maximally intense reward. Prediction 1 was supported in all six subjects, but prediction 2 was supported in only three. The latter finding is interpreted to reflect recruitment, at some stimulation sites, of a heterogeneous reward substrate comprising dual, parallel circuits that integrate the stimulation-induced neural signals.
Collapse
Affiliation(s)
- Yannick-André Breton
- Department of Psychology, Groupe de Recherche en Neurobiologie Comportementale, Center for Studies in Behavioural Neurobiology, Concordia University Montréal, QC, Canada
| | | | | | | |
Collapse
|
19
|
Abstract
Recent notions about the vigour of responding in operant conditioning suggest that the long-run average rate of reward should control the alacrity of action in cases in which the actual cost of speed is balanced against the opportunity cost of sloth. The average reward rate is suggested as being reported by tonic activity in the dopamine system and thereby influencing all actions, including ones that do not themselves lead directly to the rewards. This idea is syntactically problematical for the case of punishment. Here, we broaden the scope of the original suggestion, providing a two-factor analysis of obviated punishment in a variety of operant circumstances. We also consider the effects of stochastically successful actions, which turn out to differ rather markedly between appetitive and aversive cases. Finally, we study how to fit these ideas into nascent treatments that extend concepts of opponency between dopamine and serotonin from valence to invigoration.
Collapse
Affiliation(s)
- Peter Dayan
- Gatsby Computational Neuroscience Unit, UCL, 17 Queen Square, London WC1N 3AR, UK.
| |
Collapse
|
20
|
Abstract
Dopaminergic neurons contribute to intracranial self-stimulation (ICSS) and other reward-seeking behaviors, but it is not yet known where dopaminergic neurons intervene in the neural circuitry underlying reward pursuit or which psychological processes are involved. In rats working for electrical stimulation of the medial forebrain bundle, we assessed the effect of GBR-12909 (1-[2-[bis(4-fluorophenyl)-methoxy]ethyl]-4-[3- phenylpropyl]piperazine), a specific blocker of the dopamine transporter. Operant performance was measured as a function of the strength and cost of electrical stimulation. GBR-12909 increased the opportunity cost most subjects were willing to pay for a reward of a given intensity. However, this effect was smaller than that produced by a regimen of cocaine administration that drove similar increases in nucleus accumbens (NAc) dopamine levels in unstimulated rats. Delivery of rewarding stimulation to drug-treated rats caused an additional increase in dopamine concentration in the NAc shell in cocaine-treated, but not GBR-12909-treated, rats. These behavioral and neurochemical differences may reflect blockade of the norepinephrine transporter by cocaine but not by GBR-12909. Whereas the effect of psychomotor stimulants on ICSS has long been attributed to dopaminergic action at early stages of the reward pathway, the results reported here imply that increased dopamine tone boosts reward pursuit by acting at or beyond the output of the circuitry that temporally and spatially summates the output of the directly stimulated neurons underlying ICSS. The observed enhancement of reward seeking could be attributable to a decrease in the value of competing behaviors, a decrease in subjective effort costs, or an increase in reward-system gain.
Collapse
|
21
|
Shizgal P. Scarce means with alternative uses: robbins' definition of economics and its extension to the behavioral and neurobiological study of animal decision making. Front Neurosci 2012; 6:20. [PMID: 22363253 PMCID: PMC3275781 DOI: 10.3389/fnins.2012.00020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2011] [Accepted: 01/23/2012] [Indexed: 11/13/2022] Open
Abstract
Almost 80 years ago, Lionel Robbins proposed a highly influential definition of the subject matter of economics: the allocation of scarce means that have alternative ends. Robbins confined his definition to human behavior, and he strove to separate economics from the natural sciences in general and from psychology in particular. Nonetheless, I extend his definition to the behavior of non-human animals, rooting my account in psychological processes and their neural underpinnings. Some historical developments are reviewed that render such a view more plausible today than would have been the case in Robbins’ time. To illustrate a neuroeconomic perspective on decision making in non-human animals, I discuss research on the rewarding effect of electrical brain stimulation. Central to this discussion is an empirically based, functional/computational model of how the subjective intensity of the electrical reward is computed and combined with subjective costs so as to determine the allocation of time to the pursuit of reward. Some successes achieved by applying the model are discussed, along with limitations, and evidence is presented regarding the roles played by several different neural populations in processes posited by the model. I present a rationale for marshaling convergent experimental methods to ground psychological and computational processes in the activity of identified neural populations, and I discuss the strengths, weaknesses, and complementarity of the individual approaches. I then sketch some recent developments that hold great promise for advancing our understanding of structure–function relationships in neuroscience in general and in the neuroeconomic study of decision making in particular.
Collapse
Affiliation(s)
- Peter Shizgal
- Department of Psychology, Center for Studies in Behavioral Neurobiology, Concordia University Montréal, QC, Canada
| |
Collapse
|
22
|
Cannabinoid receptor blockade reduces the opportunity cost at which rats maintain operant performance for rewarding brain stimulation. J Neurosci 2011; 31:5426-35. [PMID: 21471378 DOI: 10.1523/jneurosci.0079-11.2011] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
There is ample evidence that blockade of CB(1) receptors reduces reward seeking. However, the reported effects of CB(1) blockade on performance for rewarding electrical brain stimulation stand out as an exception. By applying a novel method for conceptualizing and measuring reward seeking, we show that AM-251, a CB(1) receptor antagonist, does indeed decrease performance for rewarding electrical stimulation of the medial forebrain bundle in rats. Reward seeking depends on multiple sets of variables, including the intensity of the reward, its cost, and the value of competing rewards. In turn, reward intensity depends both on the sensitivity and gain of brain reward circuitry. We show that drug-induced changes in sensitivity cannot account for the suppressive effect of AM-251 on reward seeking. Therefore, the role of CB(1) receptors must be sought among the remaining determinants of performance. Our analysis provides an explanation of the inconsistencies between prior reports, which likely arose from the following: (1) the averaging of data across subjects showing heterogeneous effects and (2) the use of methods that cannot distinguish between the different determinants of reward pursuit. By means of microdialysis, we demonstrate that blockade of CB(1) receptors attenuates nucleus accumbens dopamine release in response to rewarding medial forebrain bundle stimulation, and we propose that this action is responsible for the ability of the drug to decrease performance for the electrical reward.
Collapse
|
23
|
Hernandez G, Breton YA, Conover K, Shizgal P. At what stage of neural processing does cocaine act to boost pursuit of rewards? PLoS One 2010; 5:e15081. [PMID: 21152097 PMCID: PMC2994896 DOI: 10.1371/journal.pone.0015081] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2010] [Accepted: 10/15/2010] [Indexed: 11/25/2022] Open
Abstract
Dopamine-containing neurons have been implicated in reward and decision making. One element of the supporting evidence is that cocaine, like other drugs that increase dopaminergic neurotransmission, powerfully potentiates reward seeking. We analyze this phenomenon from a novel perspective, introducing a new conceptual framework and new methodology for determining the stage(s) of neural processing at which drugs, lesions and physiological manipulations act to influence reward-seeking behavior. Cocaine strongly boosts the proclivity of rats to work for rewarding electrical brain stimulation. We show that the conventional conceptual framework and methods do not distinguish between three conflicting accounts of how the drug produces this effect: increased sensitivity of brain reward circuitry, increased gain, or decreased subjective reward costs. Sensitivity determines the stimulation strength required to produce a reward of a given intensity (a measure analogous to the KM of an enzyme) whereas gain determines the maximum intensity attainable (a measure analogous to the vmax of an enzyme-catalyzed reaction). To distinguish sensitivity changes from the other determinants, we measured and modeled reward seeking as a function of both stimulation strength and opportunity cost. The principal effect of cocaine was a two-fourfold increase in willingness to pay for the electrical reward, an effect consistent with increased gain or decreased subjective cost. This finding challenges the long-standing view that cocaine increases the sensitivity of brain reward circuitry. We discuss the implications of the results and the analytic approach for theories of how dopaminergic neurons and other diffuse modulatory brain systems contribute to reward pursuit, and we explore the implications of the conceptual framework for the study of natural rewards, drug reward, and mood.
Collapse
Affiliation(s)
- Giovanni Hernandez
- Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Yannick-André Breton
- Center for Studies in Behavioral Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Kent Conover
- Center for Studies in Behavioral Neurobiology, Concordia University, Montréal, Québec, Canada
| | - Peter Shizgal
- Center for Studies in Behavioral Neurobiology, Concordia University, Montréal, Québec, Canada
- * E-mail:
| |
Collapse
|