101
|
Dagher A. Functional brain imaging of appetite. Trends Endocrinol Metab 2012; 23:250-60. [PMID: 22483361 DOI: 10.1016/j.tem.2012.02.009] [Citation(s) in RCA: 110] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/25/2011] [Revised: 02/22/2012] [Accepted: 02/28/2012] [Indexed: 01/01/2023]
Abstract
Obesity is a neurobehavioral disorder that results from a combination of overeating and insufficient physical activity. Finely tuned mechanisms exist to match food intake to caloric expenditure. However, faced with abundant inexpensive and calorie-dense foods, many humans (and perhaps most) have a tendency to consume beyond their caloric needs. The brain controls food intake by sensing internal energy-balance signals and external cues of food availability, and by controlling feeding behavior; it is therefore at the centre of the obesity problem. This article reviews the recent use of functional brain imaging in humans to study the neural control of appetite, and how the neural systems involved may cause vulnerability to overeating in the obesogenic environment.
Collapse
Affiliation(s)
- Alain Dagher
- Montreal Neurological Institute, McGill University, 3801 University Street, Montreal, Quebec, H3A 2B4, Canada.
| |
Collapse
|
102
|
Abstract
In two experiments, effects of incentives on task switching were investigated. Incentives were provided as a monetary bonus. In both experiments, the availability of a bonus varied on a trial-to-trial basis. The main difference between the experiments relates to the association of incentives to individual tasks. In Experiment 1, the association of incentives to individual tasks was fixed. Under these conditions, the effect of incentives was largely due to reward expectancy. Switch costs were reduced to statistical insignificance. This was true even with the task that was not associated with a bonus. In Experiment 2, there was a variable association of incentives to individual tasks. Under these conditions, the reward expectancy effect was bound to conditions with a well-established bonus-task association. In conditions in which the bonus-task association was not established in advance, enhanced performance of the bonus task was accompanied by performance decrements with the task that was not associated with a bonus. Reward expectancy affected mainly the general level of performance. The outcome of this study may also inform recently suggested neurobiological accounts about the temporal dynamics of reward processing.
Collapse
Affiliation(s)
- Thomas Kleinsorge
- IfADo – Leibniz Research Centre for Working Environment and Human Factors, Dortmund, Germany
| | - Gerhard Rinkenauer
- IfADo – Leibniz Research Centre for Working Environment and Human Factors, Dortmund, Germany
| |
Collapse
|
103
|
Gold JM, Waltz JA, Matveeva TM, Kasanova Z, Strauss GP, Herbener ES, Collins AGE, Frank MJ. Negative symptoms and the failure to represent the expected reward value of actions: behavioral and computational modeling evidence. ACTA ACUST UNITED AC 2012; 69:129-38. [PMID: 22310503 DOI: 10.1001/archgenpsychiatry.2011.1269] [Citation(s) in RCA: 236] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
CONTEXT Negative symptoms are a core feature of schizophrenia, but their pathogenesis remains unclear. Negative symptoms are defined by the absence of normal function. However, there must be a productive mechanism that leads to this absence. OBJECTIVE To test a reinforcement learning account suggesting that negative symptoms result from a failure in the representation of the expected value of rewards coupled with preserved loss-avoidance learning. DESIGN Participants performed a probabilistic reinforcement learning paradigm involving stimulus pairs in which choices resulted in reward or in loss avoidance. Following training, participants indicated their valuation of the stimuli in a transfer test phase. Computational modeling was used to distinguish between alternative accounts of the data. SETTING A tertiary care research outpatient clinic. PATIENTS In total, 47 clinically stable patients with a diagnosis of schizophrenia or schizoaffective disorder and 28 healthy volunteers participated in the study. Patients were divided into a high-negative symptom group and a low-negative symptom group. MAIN OUTCOME MEASURES The number of choices leading to reward or loss avoidance, as well as performance in the transfer test phase. Quantitative fits from 3 different models were examined. RESULTS Patients in the high-negative symptom group demonstrated impaired learning from rewards but intact loss-avoidance learning and failed to distinguish rewarding stimuli from loss-avoiding stimuli in the transfer test phase. Model fits revealed that patients in the high-negative symptom group were better characterized by an "actor-critic" model, learning stimulus-response associations, whereas control subjects and patients in the low-negative symptom group incorporated expected value of their actions ("Q learning") into the selection process. CONCLUSIONS Negative symptoms in schizophrenia are associated with a specific reinforcement learning abnormality: patients with high-negative symptoms do not represent the expected value of rewards when making decisions but learn to avoid punishments through the use of prediction errors. This computational framework offers the potential to understand negative symptoms at a mechanistic level.
Collapse
Affiliation(s)
- James M Gold
- Department of Psychiatry, Maryland Psychiatric Research Center, University of Maryland School of Medicine, Baltimore, MD 21228, USA.
| | | | | | | | | | | | | | | |
Collapse
|
104
|
Bowirrat A, Chen TJH, Oscar-Berman M, Madigan M, Chen AL, Bailey JA, Braverman ER, Kerner M, Giordano J, Morse S, Downs BW, Waite RL, Fornari F, Armaly Z, Blum K. Neuropsychopharmacology and neurogenetic aspects of executive functioning: should reward gene polymorphisms constitute a diagnostic tool to identify individuals at risk for impaired judgment? Mol Neurobiol 2012; 45:298-313. [PMID: 22371275 DOI: 10.1007/s12035-012-8247-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 02/08/2012] [Indexed: 12/25/2022]
Abstract
Executive functions are processes that act in harmony to control behaviors necessary for maintaining focus and achieving outcomes. Executive dysfunction in neuropsychiatric disorders is attributed to structural or functional pathology of brain networks involving prefrontal cortex (PFC) and its connections with other brain regions. The PFC receives innervations from different neurons associated with a number of neurotransmitters, especially dopamine (DA). Here we review findings on the contribution of PFC DA to higher-order cognitive and emotional behaviors. We suggest that examination of multifactorial interactions of an individual's genetic history, along with environmental risk factors, can assist in the characterization of executive functioning for that individual. Based upon the results of genetic studies, we also propose genetic mapping as a probable diagnostic tool serving as a therapeutic adjunct for augmenting executive functioning capabilities. We conclude that preservation of the neurological underpinnings of executive functions requires the integrity of complex neural systems including the influence of specific genes and associated polymorphisms to provide adequate neurotransmission.
Collapse
Affiliation(s)
- Abdalla Bowirrat
- Clinical Neuroscience and Population Genetics, The Nazareth English Hospital (EMME), 16100 Nazareth, Israel.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
105
|
Holroyd CB, Yeung N. Motivation of extended behaviors by anterior cingulate cortex. Trends Cogn Sci 2012; 16:122-8. [PMID: 22226543 DOI: 10.1016/j.tics.2011.12.008] [Citation(s) in RCA: 446] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2011] [Revised: 11/25/2011] [Accepted: 12/06/2011] [Indexed: 11/22/2022]
Affiliation(s)
- Clay B Holroyd
- Department of Psychology, University of Victoria, P.O. Box 3050 Victoria, BC V8W 3P5, Canada.
| | | |
Collapse
|
106
|
Affiliation(s)
- Alain Dagher
- Montreal Neurological Institute, Quebec, Canada.
| |
Collapse
|
107
|
Klein-Flügge MC, Hunt LT, Bach DR, Dolan RJ, Behrens TEJ. Dissociable reward and timing signals in human midbrain and ventral striatum. Neuron 2012; 72:654-64. [PMID: 22099466 PMCID: PMC3219831 DOI: 10.1016/j.neuron.2011.08.024] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/29/2011] [Indexed: 11/30/2022]
Abstract
Reward prediction error (RPE) signals are central to current models of reward-learning. Temporal difference (TD) learning models posit that these signals should be modulated by predictions, not only of magnitude but also timing of reward. Here we show that BOLD activity in the VTA conforms to such TD predictions: responses to unexpected rewards are modulated by a temporal hazard function and activity between a predictive stimulus and reward is depressed in proportion to predicted reward. By contrast, BOLD activity in ventral striatum (VS) does not reflect a TD RPE, but instead encodes a signal on the variable relevant for behavior, here timing but not magnitude of reward. The results have important implications for dopaminergic models of cortico-striatal learning and suggest a modification of the conventional view that VS BOLD necessarily reflects inputs from dopaminergic VTA neurons signaling an RPE.
Collapse
Affiliation(s)
- Miriam C Klein-Flügge
- Sobell Department of Motor Neuroscience and Movement Disorders, Institute of Neurology, UCL, London WC1N3BG, UK.
| | | | | | | | | |
Collapse
|
108
|
Double dissociation of stimulus-value and action-value learning in humans with orbitofrontal or anterior cingulate cortex damage. J Neurosci 2011; 31:15048-52. [PMID: 22016538 DOI: 10.1523/jneurosci.3164-11.2011] [Citation(s) in RCA: 132] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Adaptive decision making involves selecting the most valuable option, typically by taking an action. Such choices require value comparisons, but there is debate about whether these comparisons occur at the level of stimuli (goods-based) value, action-based value, or both. One view is that value processes occur in series, with stimulus value informing action value. However, lesion work in nonhuman primates suggests that these two kinds of choice are dissociable. Here, we examined action-value and stimulus-value learning in humans with focal frontal lobe damage. Orbitofrontal damage disrupted the ability to sustain the correct choice of stimulus, but not of action, after positive feedback, while damage centered on dorsal anterior cingulate cortex led to the opposite deficit. These findings argue that there are distinct, domain-specific mechanisms by which outcome value is applied to guide subsequent decisions, depending on whether the choice is between stimuli or between actions.
Collapse
|
109
|
Berthet P, Lansner A. An abstract model of the basal ganglia, reward learning and action selection. BMC Neurosci 2011. [PMCID: PMC3240288 DOI: 10.1186/1471-2202-12-s1-p189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
110
|
Morris SE, Holroyd CB, Mann-Wrobel MC, Gold JM. Dissociation of response and feedback negativity in schizophrenia: electrophysiological and computational evidence for a deficit in the representation of value. Front Hum Neurosci 2011; 5:123. [PMID: 22065618 PMCID: PMC3203413 DOI: 10.3389/fnhum.2011.00123] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Accepted: 10/10/2011] [Indexed: 11/20/2022] Open
Abstract
Contrasting theories of schizophrenia propose that the disorder is characterized by a deficit in phasic changes in dopamine activity in response to ongoing events or, alternatively, by a weakness in the representation of the value of responses. Schizophrenia patients have reliably reduced brain activity following incorrect responses but other research suggests that they may have intact feedback-related potentials, indicating that the impairment may be specifically response-related. We used event-related brain potentials and computational modeling to examine this issue by comparing the neural response to outcomes with the neural response to behaviors that predict outcomes in patients with schizophrenia and psychiatrically healthy comparison subjects. We recorded feedback-related activity in a passive gambling task and a time estimation task and error-related activity in a flanker task. Patients' brain activity following an erroneous response was reduced compared to comparison subjects but feedback-related activity did not differ between groups. To test hypotheses about the possible causes of this pattern of results, we used computational modeling of the electrophysiological data to simulate the effects of an overall reduction in patients' sensitivity to feedback, selective insensitivity to positive or negative feedback, reduced learning rate, and a decreased representation of the value of the response given the stimulus on each trial. The results of the computational modeling suggest that schizophrenia patients exhibit weakened representation of response values, possibly due to failure of the basal ganglia to strongly associate stimuli with appropriate response alternatives.
Collapse
Affiliation(s)
- Sarah E. Morris
- VISN 5 Mental Illness Research, Education, and Clinical CenterBaltimore, MD, USA
- Department of Psychiatry, University of Maryland School of MedicineBaltimore, MD, USA
| | - Clay B. Holroyd
- Department of Psychology, University of VictoriaVictoria, BC, Canada
| | | | - James M. Gold
- Maryland Psychiatric Research Center, University of Maryland School of MedicineCatonsville, MD, USA
| |
Collapse
|
111
|
Abstract
The basal ganglia are a chain of subcortical nuclei that facilitate action selection. Two striatal projection systems--so-called direct and indirect pathways--form the functional backbone of the basal ganglia circuit. Twenty years ago, investigators proposed that the striatum's ability to use dopamine (DA) rise and fall to control action selection was due to the segregation of D(1) and D(2) DA receptors in direct- and indirect-pathway spiny projection neurons. Although this hypothesis sparked a debate, the evidence that has accumulated since then clearly supports this model. Recent advances in the means of marking neural circuits with optical or molecular reporters have revealed a clear-cut dichotomy between these two cell types at the molecular, anatomical, and physiological levels. The contrast provided by these studies has provided new insights into how the striatum responds to fluctuations in DA signaling and how diseases that alter this signaling change striatal function.
Collapse
Affiliation(s)
- Charles R Gerfen
- Laboratory of Systems Neuroscience, National Institute of Mental Health, Bethesda, Maryland 20892, USA.
| | | |
Collapse
|
112
|
Shiflett MW, Balleine BW. Molecular substrates of action control in cortico-striatal circuits. Prog Neurobiol 2011; 95:1-13. [PMID: 21704115 PMCID: PMC3175490 DOI: 10.1016/j.pneurobio.2011.05.007] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2010] [Revised: 05/24/2011] [Accepted: 05/27/2011] [Indexed: 11/24/2022]
Abstract
The purpose of this review is to describe the molecular mechanisms in the striatum that mediate reward-based learning and action control during instrumental conditioning. Experiments assessing the neural bases of instrumental conditioning have uncovered functional circuits in the striatum, including dorsal and ventral striatal sub-regions, involved in action-outcome learning, stimulus-response learning, and the motivational control of action by reward-associated cues. Integration of dopamine (DA) and glutamate neurotransmission within these striatal sub-regions is hypothesized to enable learning and action control through its role in shaping synaptic plasticity and cellular excitability. The extracellular signal regulated kinase (ERK) appears to be particularly important for reward-based learning and action control due to its sensitivity to combined DA and glutamate receptor activation and its involvement in a range of cellular functions. ERK activation in striatal neurons is proposed to have a dual role in both the learning and performance factors that contribute to instrumental conditioning through its regulation of plasticity-related transcription factors and its modulation of intrinsic cellular excitability. Furthermore, perturbation of ERK activation by drugs of abuse may give rise to behavioral disorders such as addiction.
Collapse
|
113
|
Seger CA, Spiering BJ. A critical review of habit learning and the Basal Ganglia. Front Syst Neurosci 2011; 5:66. [PMID: 21909324 PMCID: PMC3163829 DOI: 10.3389/fnsys.2011.00066] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2011] [Accepted: 08/01/2011] [Indexed: 11/23/2022] Open
Abstract
The current paper briefly outlines the historical development of the concept of habit learning and discusses its relationship to the basal ganglia. Habit learning has been studied in many different fields of neuroscience using different species, tasks, and methodologies, and as a result it has taken on a wide range of definitions from these various perspectives. We identify five common but not universal, definitional features of habit learning: that it is inflexible, slow or incremental, unconscious, automatic, and insensitive to reinforcer devaluation. We critically evaluate for each of these how it has been defined, its utility for research in both humans and non-human animals, and the evidence that it serves as an accurate description of basal ganglia function. In conclusion, we propose a multi-faceted approach to habit learning and its relationship to the basal ganglia, emphasizing the need for formal definitions that will provide directions for future research.
Collapse
Affiliation(s)
- Carol A Seger
- Department of Psychology, Colorado State University Fort Collins, CO, USA
| | | |
Collapse
|
114
|
Baker TE, Stockwell T, Barnes G, Holroyd CB. Individual differences in substance dependence: at the intersection of brain, behaviour and cognition. Addict Biol 2011; 16:458-66. [PMID: 20731633 DOI: 10.1111/j.1369-1600.2010.00243.x] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Recent theories of drug dependence propose that the transition from occasional recreational substance use to harmful use and dependence results from the impact of disrupted midbrain dopamine signals for reinforcement learning on frontal brain areas that implement cognitive control and decision-making. We investigated this hypothesis in humans using electrophysiological and behavioral measures believed to assay the integrity of midbrain dopamine system and its neural targets. Our investigation revealed two groups of dependent individuals, one characterized by disrupted dopamine-dependent reward learning and the other by disrupted error learning associated with depression-proneness. These results highlight important neurobiological and behavioral differences between two classes of dependent users that can inform the development of individually tailored treatment programs.
Collapse
Affiliation(s)
- Travis E Baker
- Department of Psychology, Center of Addiction Research of British Columbia, Child and Youth Care, University of Victoria, Victoria, BC, Canada.
| | | | | | | |
Collapse
|
115
|
Badre D, Frank MJ. Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: evidence from fMRI. Cereb Cortex 2011; 22:527-36. [PMID: 21693491 DOI: 10.1093/cercor/bhr117] [Citation(s) in RCA: 144] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
The frontal lobes may be organized hierarchically such that more rostral frontal regions modulate cognitive control operations in caudal regions. In our companion paper (Frank MJ, Badre D. 2011. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits I: computational analysis. 22:509-526), we provide novel neural circuit and algorithmic models of hierarchical cognitive control in cortico-striatal circuits. Here, we test key model predictions using functional magnetic resonance imaging (fMRI). Our neural circuit model proposes that contextual representations in rostral frontal cortex influence the striatal gating of contextual representations in caudal frontal cortex. Reinforcement learning operates at each level, such that the system adaptively learns to gate higher order contextual information into rostral regions. Our algorithmic Bayesian "mixture of experts" model captures the key computations of this neural model and provides trial-by-trial estimates of the learner's latent hypothesis states. In the present paper, we used these quantitative estimates to reanalyze fMRI data from a hierarchical reinforcement learning task reported in Badre D, Kayser AS, D'Esposito M. 2010. Frontal cortex and the discovery of abstract action rules. Neuron. 66:315--326. Results validate key predictions of the models and provide evidence for an individual cortico-striatal circuit for reinforcement learning of hierarchical structure at a specific level of policy abstraction. These findings are initially consistent with the proposal that hierarchical control in frontal cortex may emerge from interactions among nested cortico-striatal circuits at different levels of abstraction.
Collapse
Affiliation(s)
- David Badre
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown Institute for Brain Sciences, Brown University, Providence, RI 02912-1978, USA.
| | | |
Collapse
|
116
|
Parush N, Tishby N, Bergman H. Dopaminergic Balance between Reward Maximization and Policy Complexity. Front Syst Neurosci 2011; 5:22. [PMID: 21603228 PMCID: PMC3093748 DOI: 10.3389/fnsys.2011.00022] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2010] [Accepted: 04/20/2011] [Indexed: 11/17/2022] Open
Abstract
Previous reinforcement-learning models of the basal ganglia network have highlighted the role of dopamine in encoding the mismatch between prediction and reality. Far less attention has been paid to the computational goals and algorithms of the main-axis (actor). Here, we construct a top-down model of the basal ganglia with emphasis on the role of dopamine as both a reinforcement learning signal and as a pseudo-temperature signal controlling the general level of basal ganglia excitability and motor vigilance of the acting agent. We argue that the basal ganglia endow the thalamic-cortical networks with the optimal dynamic tradeoff between two constraints: minimizing the policy complexity (cost) and maximizing the expected future reward (gain). We show that this multi-dimensional optimization processes results in an experience-modulated version of the softmax behavioral policy. Thus, as in classical softmax behavioral policies, probability of actions are selected according to their estimated values and the pseudo-temperature, but in addition also vary according to the frequency of previous choices of these actions. We conclude that the computational goal of the basal ganglia is not to maximize cumulative (positive and negative) reward. Rather, the basal ganglia aim at optimization of independent gain and cost functions. Unlike previously suggested single-variable maximization processes, this multi-dimensional optimization process leads naturally to a softmax-like behavioral policy. We suggest that beyond its role in the modulation of the efficacy of the cortico-striatal synapses, dopamine directly affects striatal excitability and thus provides a pseudo-temperature signal that modulates the tradeoff between gain and cost. The resulting experience and dopamine modulated softmax policy can then serve as a theoretical framework to account for the broad range of behaviors and clinical states governed by the basal ganglia and dopamine systems.
Collapse
Affiliation(s)
- Naama Parush
- The Interdisciplinary Center for Neural Computation, The Hebrew University Jerusalem, Israel
| | | | | |
Collapse
|
117
|
Waltz JA, Frank MJ, Wiecki TV, Gold JM. Altered probabilistic learning and response biases in schizophrenia: behavioral evidence and neurocomputational modeling. Neuropsychology 2011; 25:86-97. [PMID: 21090899 DOI: 10.1037/a0020882] [Citation(s) in RCA: 100] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
OBJECTIVE Patients with schizophrenia (SZ) show reinforcement learning impairments related to both the gradual/procedural acquisition of reward contingencies, and the ability to use trial-to-trial feedback to make rapid behavioral adjustments. METHOD We used neurocomputational modeling to develop plausible mechanistic hypotheses explaining reinforcement learning impairments in individuals with SZ. We tested the model with a novel Go/NoGo learning task in which subjects had to learn to respond or withhold responses when presented with different stimuli associated with different probabilities of gains or losses in points. We analyzed data from 34 patients and 23 matched controls, characterizing positive- and negative-feedback-driven learning in both a training phase and a test phase. RESULTS Consistent with simulations from a computational model of aberrant dopamine input to the basal ganglia patients, patients with SZ showed an overall increased rate of responding in the training phase, together with reduced response-time acceleration to frequently rewarded stimuli across training blocks, and a reduced relative preference for frequently rewarded training stimuli in the test phase. Patients did not differ from controls on measures of procedural negative-feedback-driven learning, although patients with SZ exhibited deficits in trial-to-trial adjustments to negative feedback, with these measures correlating with negative symptom severity. CONCLUSIONS These findings support the hypothesis that patients with SZ have a deficit in procedural "Go" learning, linked to abnormalities in DA transmission at D1-type receptors, despite a "Go bias" (increased response rate), potentially related to excessive tonic dopamine. Deficits in trial-to-trial reinforcement learning were limited to a subset of patients with SZ with severe negative symptoms, putatively stemming from prefrontal cortical dysfunction.
Collapse
Affiliation(s)
- James A Waltz
- Department of Psychiatry, University of Maryland School of Medicine, MD Psychiatric Research Center, P.O. Box 21247, Baltimore, MD 21228, USA.
| | | | | | | |
Collapse
|
118
|
Frank MJ. Computational models of motivated action selection in corticostriatal circuits. Curr Opin Neurobiol 2011; 21:381-6. [PMID: 21498067 DOI: 10.1016/j.conb.2011.02.013] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2011] [Accepted: 02/25/2011] [Indexed: 11/19/2022]
Abstract
Computational models of the basal ganglia have matured and received increasing attention over the last decade. This article reviews some of the theoretical advances offered by these models, focusing on motor and cognitive action selection, learning, and the interaction between multiple corticostriatal circuits in selection and learning.
Collapse
Affiliation(s)
- Michael J Frank
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI, USA.
| |
Collapse
|
119
|
Koziol LF, Budding DE, Chidekel D. Adaptation, expertise, and giftedness: towards an understanding of cortical, subcortical, and cerebellar network contributions. THE CEREBELLUM 2011; 9:499-529. [PMID: 20680539 DOI: 10.1007/s12311-010-0192-7] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Current cortico-centric models of cognition lack a cohesive neuroanatomic framework that sufficiently considers overlapping levels of function, from "pathological" through "normal" to "gifted" or exceptional ability. While most cognitive theories presume an evolutionary context, few actively consider the process of adaptation, including concepts of neurodevelopment. Further, the frequent co-occurrence of "gifted" and "pathological" function is difficult to explain from a cortico-centric point of view. This comprehensive review paper proposes a framework that includes the brain's vertical organization and considers "giftedness" from an evolutionary and neurodevelopmental vantage point. We begin by discussing the current cortico-centric model of cognition and its relationship to intelligence. We then review an integrated, dual-tiered model of cognition that better explains the process of adaptation by simultaneously allowing for both stimulus-based processing and higher-order cognitive control. We consider the role of the basal ganglia within this model, particularly in relation to reward circuitry and instrumental learning. We review the important role of white matter tracts in relation to speed of adaptation and development of behavioral mastery. We examine the cerebellum's critical role in behavioral refinement and in cognitive and behavioral automation, particularly in relation to expertise and giftedness. We conclude this integrated model of brain function by considering the savant syndrome, which we believe is best understood within the context of a dual-tiered model of cognition that allows for automaticity in adaptation as well as higher-order executive control.
Collapse
|
120
|
Au WL, Zhou J, Palmes P, Sitoh YY, Tan LC, Rajapakse JC. Levodopa and the feedback process on set-shifting in Parkinson's disease. Hum Brain Mapp 2011; 33:27-39. [PMID: 21438075 DOI: 10.1002/hbm.21187] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2010] [Revised: 09/04/2010] [Accepted: 09/20/2010] [Indexed: 11/10/2022] Open
Abstract
OBJECTIVE To study the interaction between levodopa and the feedback process on set-shifting in Parkinson's disease (PD). METHODS Functional magnetic resonance imaging (fMRI) studies were performed on 13 PD subjects and 17 age-matched healthy controls while they performed a modified card-sorting task. Experimental time periods were defined based on the types of feedback provided. PD subjects underwent the fMRI experiment twice, once during "off" medication (PDoff) and again after levodopa replacement (PDon). RESULTS Compared with normal subjects, the cognitive processing times were prolonged in PDoff but not in PDon subjects during learning through positive outcomes. The ability to set-shift through negative outcomes was not affected in PD subjects, even when "off" medication. Intergroup comparisons showed the lateral prefrontal cortex was deactivated in PDoff subjects during positive feedback learning, especially following internal feedback cues. The cortical activations were increased in the posterior brain regions in PDoff subjects following external feedback learning, especially when negative feedback cues were provided. Levodopa replacement did not completely restore the activation patterns in PD subjects to normal although activations in the corticostriatal loops were restored. CONCLUSION PD subjects showed differential ability to set-shift, depending on the dopamine status as well as the types of feedback cues provided. PD subjects had difficulty performing set-shift tasks through positive outcomes when "off" medication, and showed improvement after levodopa replacement. The ability to set-shift through negative feedback was not affected in PD subjects even when "off" medication, possibly due to compensatory changes outside the nigrostriatal dopaminergic pathway.
Collapse
Affiliation(s)
- Wing Lok Au
- Department of Neurology, Parkinson's Disease and Movement Disorders Centre, National Neuroscience Institute, Singapore.
| | | | | | | | | | | |
Collapse
|
121
|
Kamali Sarvestani I, Lindahl M, Hellgren-Kotaleski J, Ekeberg O. The arbitration-extension hypothesis: a hierarchical interpretation of the functional organization of the Basal Ganglia. Front Syst Neurosci 2011; 5:13. [PMID: 21441994 PMCID: PMC3061412 DOI: 10.3389/fnsys.2011.00013] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2010] [Accepted: 02/24/2011] [Indexed: 11/30/2022] Open
Abstract
Based on known anatomy and physiology, we present a hypothesis where the basal ganglia motor loop is hierarchically organized in two main subsystems: the arbitration system and the extension system. The arbitration system, comprised of the subthalamic nucleus, globus pallidus, and pedunculopontine nucleus, serves the role of selecting one out of several candidate actions as they are ascending from various brain stem motor regions and aggregated in the centromedian thalamus or descending from the extension system or from the cerebral cortex. This system is an action-input/action-output system whose winner-take-all mechanism finds the strongest response among several candidates to execute. This decision is communicated back to the brain stem by facilitating the desired action via cholinergic/glutamatergic projections and suppressing conflicting alternatives via GABAergic connections. The extension system, comprised of the striatum and, again, globus pallidus, can extend the repertoire of responses by learning to associate novel complex states to certain actions. This system is a state-input/action-output system, whose organization enables it to encode arbitrarily complex Boolean logic rules using striatal neurons that only fire given specific constellations of inputs (Boolean AND) and pallidal neurons that are silenced by any striatal input (Boolean OR). We demonstrate the capabilities of this hierarchical system by a computational model where a simulated generic “animal” interacts with an environment by selecting direction of movement based on combinations of sensory stimuli, some being appetitive, others aversive or neutral. While the arbitration system can autonomously handle conflicting actions proposed by brain stem motor nuclei, the extension system is required to execute learned actions not suggested by external motor centers. Being precise in the functional role of each component of the system, this hypothesis generates several readily testable predictions.
Collapse
Affiliation(s)
- Iman Kamali Sarvestani
- Department of Computational Biology, School of Computer Science and Communication, Royal Institute of Technology Stockholm, Sweden
| | | | | | | |
Collapse
|
122
|
Maia TV, Frank MJ. From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci 2011; 14:154-62. [PMID: 21270784 DOI: 10.1038/nn.2723] [Citation(s) in RCA: 465] [Impact Index Per Article: 33.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Over the last decade and a half, reinforcement learning models have fostered an increasingly sophisticated understanding of the functions of dopamine and cortico-basal ganglia-thalamo-cortical (CBGTC) circuits. More recently, these models, and the insights that they afford, have started to be used to understand important aspects of several psychiatric and neurological disorders that involve disturbances of the dopaminergic system and CBGTC circuits. We review this approach and its existing and potential applications to Parkinson's disease, Tourette's syndrome, attention-deficit/hyperactivity disorder, addiction, schizophrenia and preclinical animal models used to screen new antipsychotic drugs. The approach's proven explanatory and predictive power bodes well for the continued growth of computational psychiatry and computational neurology.
Collapse
Affiliation(s)
- Tiago V Maia
- Department of Psychiatry, Columbia University, New York, New York, USA.
| | | |
Collapse
|
123
|
Deficits in positive reinforcement learning and uncertainty-driven exploration are associated with distinct aspects of negative symptoms in schizophrenia. Biol Psychiatry 2011; 69:424-31. [PMID: 21168124 PMCID: PMC3039035 DOI: 10.1016/j.biopsych.2010.10.015] [Citation(s) in RCA: 170] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/23/2010] [Revised: 09/24/2010] [Accepted: 10/12/2010] [Indexed: 11/23/2022]
Abstract
BACKGROUND Negative symptoms are core features of schizophrenia (SZ); however, the cognitive and neural basis for individual negative symptom domains remains unclear. Converging evidence suggests a role for striatal and prefrontal dopamine in reward learning and the exploration of actions that might produce outcomes that are better than the status quo. The current study examines whether deficits in reinforcement learning and uncertainty-driven exploration predict specific negative symptom domains. METHODS We administered a temporal decision-making task, which required trial-by-trial adjustment of reaction time to maximize reward receipt, to 51 patients with SZ and 39 age-matched healthy control subjects. Task conditions were designed such that expected value (probability × magnitude) increased, decreased, or remained constant with increasing response times. Computational analyses were applied to estimate the degree to which trial-by-trial responses are influenced by reinforcement history. RESULTS Individuals with SZ showed impaired Go learning but intact NoGo learning relative to control subjects. These effects were most pronounced in patients with higher levels of negative symptoms. Uncertainty-based exploration was substantially reduced in individuals with SZ and selectively correlated with clinical ratings of anhedonia. CONCLUSIONS Schizophrenia patients, particularly those with high negative symptoms, failed to speed reaction times to increase positive outcomes and showed reduced tendency to explore when alternative actions could lead to better outcomes than the status quo. Results are interpreted in the context of current computational, genetic, and pharmacological data supporting the roles of striatal and prefrontal dopamine in these processes.
Collapse
|
124
|
Boureau YL, Dayan P. Opponency revisited: competition and cooperation between dopamine and serotonin. Neuropsychopharmacology 2011; 36:74-97. [PMID: 20881948 PMCID: PMC3055522 DOI: 10.1038/npp.2010.151] [Citation(s) in RCA: 301] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/02/2010] [Revised: 08/03/2010] [Accepted: 08/03/2010] [Indexed: 11/08/2022]
Abstract
Affective valence lies on a spectrum ranging from punishment to reward. The coding of such spectra in the brain almost always involves opponency between pairs of systems or structures. There is ample evidence for the role of dopamine in the appetitive half of this spectrum, but little agreement about the existence, nature, or role of putative aversive opponents such as serotonin. In this review, we consider the structure of opponency in terms of previous biases about the nature of the decision problems that animals face, the conflicts that may thus arise between Pavlovian and instrumental responses, and an additional spectrum joining invigoration to inhibition. We use this analysis to shed light on aspects of the role of serotonin and its interactions with dopamine.
Collapse
Affiliation(s)
- Y-Lan Boureau
- The Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
| | - Peter Dayan
- Gatsby Computational Neuroscience Unit, London, UK
| |
Collapse
|
125
|
Frank MJ, Fossella JA. Neurogenetics and pharmacology of learning, motivation, and cognition. Neuropsychopharmacology 2011; 36:133-52. [PMID: 20631684 PMCID: PMC3055524 DOI: 10.1038/npp.2010.96] [Citation(s) in RCA: 146] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2010] [Revised: 06/09/2010] [Accepted: 06/10/2010] [Indexed: 02/07/2023]
Abstract
Many of the individual differences in cognition, motivation, and learning-and the disruption of these processes in neurological conditions-are influenced by genetic factors. We provide an integrative synthesis across human and animal studies, focusing on a recent spate of evidence implicating a role for genes controlling dopaminergic function in frontostriatal circuitry, including COMT, DARPP-32, DAT1, DRD2, and DRD4. These genetic effects are interpreted within theoretical frameworks developed in the context of the broader cognitive and computational neuroscience literature, constrained by data from pharmacological, neuroimaging, electrophysiological, and patient studies. In this framework, genes modulate the efficacy of particular neural computations, and effects of genetic variation are revealed by assays designed to be maximally sensitive to these computations. We discuss the merits and caveats of this approach and outline a number of novel candidate genes of interest for future study.
Collapse
Affiliation(s)
- Michael J Frank
- Department of Cognitive, Linguistic and Psychological Sciences, Brown Institute for Brain Science, Brown University, Providence, RI 02912-1978, USA.
| | | |
Collapse
|
126
|
EEG oscillatory patterns are associated with error prediction during music performance and are altered in musician's dystonia. Neuroimage 2010; 55:1791-803. [PMID: 21195188 DOI: 10.1016/j.neuroimage.2010.12.050] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Revised: 11/19/2010] [Accepted: 12/20/2010] [Indexed: 11/23/2022] Open
Abstract
Skilled performance requires the ability to monitor ongoing behavior, detect errors in advance and modify the performance accordingly. The acquisition of fast predictive mechanisms might be possible due to the extensive training characterizing expertise performance. Recent EEG studies on piano performance reported a negative event-related potential (ERP) triggered in the ACC 70 ms before performance errors (pitch errors due to incorrect keypress). This ERP component, termed pre-error related negativity (pre-ERN), was assumed to reflect processes of error detection in advance. However, some questions remained to be addressed: (i) Does the electrophysiological marker prior to errors reflect an error signal itself or is it related instead to the implementation of control mechanisms? (ii) Does the posterior frontomedial cortex (pFMC, including ACC) interact with other brain regions to implement control adjustments following motor prediction of an upcoming error? (iii) Can we gain insight into the electrophysiological correlates of error prediction and control by assessing the local neuronal synchronization and phase interaction among neuronal populations? (iv) Finally, are error detection and control mechanisms defective in pianists with musician's dystonia (MD), a focal task-specific dystonia resulting from dysfunction of the basal ganglia-thalamic-frontal circuits? Consequently, we investigated the EEG oscillatory and phase synchronization correlates of error detection and control during piano performances in healthy pianists and in a group of pianists with MD. In healthy pianists, the main outcomes were increased pre-error theta and beta band oscillations over the pFMC and 13-15 Hz phase synchronization, between the pFMC and the right lateral prefrontal cortex, which predicted corrective mechanisms. In MD patients, the pattern of phase synchronization appeared in a different frequency band (6-8 Hz) and correlated with the severity of the disorder. The present findings shed new light on the neural mechanisms, which might implement motor prediction by means of forward control processes, as they function in healthy pianists and in their altered form in patients with MD.
Collapse
|
127
|
Pan WX, Mao T, Dudman JT. Inputs to the dorsal striatum of the mouse reflect the parallel circuit architecture of the forebrain. Front Neuroanat 2010; 4:147. [PMID: 21212837 PMCID: PMC3014656 DOI: 10.3389/fnana.2010.00147] [Citation(s) in RCA: 89] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2010] [Accepted: 11/23/2010] [Indexed: 11/13/2022] Open
Abstract
The basal ganglia play a critical role in the regulation of voluntary action in vertebrates. Our understanding of the function of the basal ganglia relies heavily upon anatomical information, but continued progress will require an understanding of the specific functional roles played by diverse cell types and their connectivity. An increasing number of mouse lines allow extensive identification, characterization, and manipulation of specified cell types in the basal ganglia. Despite the promise of genetically modified mice for elucidating the functional roles of diverse cell types, there is relatively little anatomical data obtained directly in the mouse. Here we have characterized the retrograde labeling obtained from a series of tracer injections throughout the dorsal striatum of adult mice. We found systematic variations in input along both the medial–lateral and anterior–posterior neuraxes in close agreement with canonical features of basal ganglia anatomy in the rat. In addition to the canonical features we have provided experimental support for the importance of non-canonical inputs to the striatum from the raphe nuclei and the amygdala. To look for organization at a finer scale we have analyzed the correlation structure of labeling intensity across our entire dataset. Using this analysis we found substantial local heterogeneity within the large-scale order. From this analysis we conclude that individual striatal sites receive varied combinations of cortical and thalamic input from multiple functional areas, consistent with some earlier studies in the rat that have suggested the presence of a combinatorial map.
Collapse
Affiliation(s)
- Weixing X Pan
- Janelia Farm Research Campus, Howard Hughes Medical Institute , Ashburn, VA, USA
| | | | | |
Collapse
|
128
|
Shiflett MW, Balleine BW. Contributions of ERK signaling in the striatum to instrumental learning and performance. Behav Brain Res 2010; 218:240-7. [PMID: 21147168 DOI: 10.1016/j.bbr.2010.12.010] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2010] [Accepted: 12/07/2010] [Indexed: 02/07/2023]
Abstract
The striatum is critical for learning and decision making; however, the molecular mechanisms that govern striatum function are not fully understood. The extracellular signal regulated kinase (ERK) cascade is an important signaling pathway that underlies synaptic plasticity, cellular excitability, learning and arousal. This review focuses on the role of ERK signaling in striatum function. ERK is activated in the striatum by coordinated dopamine and glutamate receptor signaling, where it underlies corticostriatal synaptic plasticity and influences striatal cell excitability. ERK activation in the dorsal striatum is necessary for action-outcome learning and performance of goal-directed actions. In the ventral striatum, ERK is necessary for the motivating effects of reward-associated stimuli on instrumental performance. Dysregulation of ERK signaling in the striatum by repeated drug exposure contributes to the development of addictive behavior. These results highlight the importance of ERK signaling in the striatum as a critical substrate for learning and as a regulator of ongoing behavior. Furthermore, they suggest that ERK may be a suitable target for therapeutics to treat disorders of learning and decision making that arise from compromised striatum function.
Collapse
Affiliation(s)
- Michael W Shiflett
- Department of Psychology, Rutgers University, 301 Smith Hall, 101 Warren St., Newark, NJ 07102, USA.
| | | |
Collapse
|
129
|
Focus on the positive: computational simulations implicate asymmetrical reward prediction error signals in childhood attention-deficit/hyperactivity disorder. Brain Res 2010; 1365:18-34. [PMID: 20875804 DOI: 10.1016/j.brainres.2010.09.065] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2010] [Revised: 08/30/2010] [Accepted: 09/17/2010] [Indexed: 02/02/2023]
Abstract
A number of hypotheses have suggested that the principal neurological dysfunction responsible for the behavioural symptoms associated with Attention-Deficit/Hyperactive Disorder (ADHD) is likely rooted in abnormal phasic signals coded by the firing rate of midbrain dopamine neurons. We present a formal investigation of the impact atypical phasic dopamine signals have on behaviour by applying a TD(λ) reinforcement learning model to simulations of operant conditioning tasks that have been argued to quantify the hyperactive, inattentive and impulsive behaviour associated with ADHD. The results presented here suggest that asymmetrically effective dopamine signals encoded by a punctate increase or decrease in dopamine levels provide the best account for the behaviour of children with ADHD as well as an animal model of ADHD, the spontaneously hypertensive rat (SHR). The biological sources of this asymmetry are considered, as are other computational models of ADHD.
Collapse
|
130
|
Robinson OJ, Frank MJ, Sahakian BJ, Cools R. Dissociable responses to punishment in distinct striatal regions during reversal learning. Neuroimage 2010; 51:1459-67. [PMID: 20303408 PMCID: PMC3038262 DOI: 10.1016/j.neuroimage.2010.03.036] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2010] [Revised: 03/04/2010] [Accepted: 03/11/2010] [Indexed: 12/30/2022] Open
Abstract
Adaptive behavior depends on the ability to flexibly alter our choices in response to changes in reward and punishment contingencies. One brain region frequently implicated in such behavior is the striatum. However, this region is functionally diverse and there are a number of apparent inconsistencies across previous studies. For instance, how can significant BOLD responses in the ventral striatum during punishment-based reversal learning be reconciled with the frequently demonstrated role of the ventral striatum in reward processing? Here we attempt to address this question by separately examining BOLD responses during reversal learning driven by reward and during reversal learning driven by punishment. We demonstrate simultaneous valence-specific and valence-nonspecific signals in the striatum, with the posterior dorsal striatum responding only to unexpected reward, and the anterior ventral striatum responding to both unexpected punishment as well as unexpected reward. These data help to reconcile conflicting findings from previous studies by showing that distinct regions of the striatum exhibit dissociable responses to punishment during reversal learning.
Collapse
Affiliation(s)
- Oliver J Robinson
- Department of Psychiatry and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, Addenbrooke's Hospital, P. O. Box 189, Level E4, Hills Road, Cambridge, CB2 2QQ, UK.
| | | | | | | |
Collapse
|
131
|
Mittal VA, Walker EF, Bearden CE, Walder D, Trottman H, Daley M, Simone A, Cannon TD. Markers of basal ganglia dysfunction and conversion to psychosis: neurocognitive deficits and dyskinesias in the prodromal period. Biol Psychiatry 2010; 68:93-9. [PMID: 20227679 PMCID: PMC2891189 DOI: 10.1016/j.biopsych.2010.01.021] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/31/2009] [Revised: 01/17/2010] [Accepted: 01/20/2010] [Indexed: 11/22/2022]
Abstract
BACKGROUND Movement abnormalities and cognitive deficits may represent external markers of an underlying neural process linked with the early etiology of psychosis. As basal ganglia function plays a governing role in both movement and cognitive processes, an understanding of the relationship between these phenomena stands to inform etiologic conceptualizations of vulnerability and psychotic disorders. METHODS In this investigation, trained raters coded movement abnormalities in videotapes from structured interviews of adolescents and young adults with a prodromal risk syndrome (n = 90). The participants were administered a neuropsychological battery including measures of verbal comprehension, perceptual organization, immediate/delayed auditory memory, and an estimate of full-scale intelligence quotient. Diagnostic status was followed for a 2-year period utilizing structured clinical interviews, during which time 24 high-risk participants (26.66%) converted to an Axis I psychotic disorder. RESULTS Elevated dyskinetic movements in the upper-body region were correlated with deficits in domains of verbal comprehension, perceptual organization, and both immediate and delayed auditory memory. Further, discriminant function analyses indicated that baseline movement abnormalities and neurocognitive deficits significantly classified those high-risk participants who would eventually convert to a psychotic disorder (72.3%). CONCLUSIONS Results support a common cortico-striato-pallido-thalamic circuit irregularity, underlying both movement abnormalities and cognitive deficits in individuals at high risk for psychosis. Models incorporating external markers of progressive basal ganglia dysfunction may enhance detection and preventive intervention for those high-risk individuals most in need of treatment.
Collapse
Affiliation(s)
- Vijay A Mittal
- Department of Psychology, University of California Los Angeles, Los Angeles, California, USA.
| | | | | | | | | | | | | | | |
Collapse
|
132
|
Goldstein MH, Waterfall HR, Lotem A, Halpern JY, Schwade JA, Onnis L, Edelman S. General cognitive principles for learning structure in time and space. Trends Cogn Sci 2010; 14:249-58. [PMID: 20395164 PMCID: PMC11655155 DOI: 10.1016/j.tics.2010.02.004] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2009] [Revised: 02/18/2010] [Accepted: 02/19/2010] [Indexed: 12/27/2022]
Abstract
How are hierarchically structured sequences of objects, events or actions learned from experience and represented in the brain? When several streams of regularities present themselves, which will be learned and which ignored? Can statistical regularities take effect on their own, or are additional factors such as behavioral outcomes expected to influence statistical learning? Answers to these questions are starting to emerge through a convergence of findings from naturalistic observations, behavioral experiments, neurobiological studies, and computational analyses and simulations. We propose that a small set of principles are at work in every situation that involves learning of structure from patterns of experience and outline a general framework that accounts for such learning.
Collapse
Affiliation(s)
- Michael H Goldstein
- 211 Uris Hall, Department of Psychology, Cornell University, Ithaca, NY 14853, USA.
| | | | | | | | | | | | | |
Collapse
|
133
|
Bortolanza M, Wietzikoski EC, Boschen SL, Dombrowski PA, Latimer M, Maclaren DAA, Winn P, Da Cunha C. Functional disconnection of the substantia nigra pars compacta from the pedunculopontine nucleus impairs learning of a conditioned avoidance task. Neurobiol Learn Mem 2010; 94:229-39. [PMID: 20595069 DOI: 10.1016/j.nlm.2010.05.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2010] [Revised: 05/17/2010] [Accepted: 05/26/2010] [Indexed: 01/12/2023]
Abstract
The pedunculopontine tegmental nucleus (PPTg) targets nuclei in the basal ganglia, including the substantia nigra pars compacta (SNc), in which neuronal loss occurs in Parkinson's disease, a condition in which patients show cognitive as well as motor disturbances. Partial loss and functional abnormalities of neurons in the PPTg are also associated with Parkinson's disease. We hypothesized that the interaction of PPTg and SNc might be important for cognitive impairments and so investigated whether disrupting the connections between the PPTg and SNc impaired learning of a conditioned avoidance response (CAR) by male Wistar rats. The following groups were tested: PPTg unilateral; SNc unilateral; PPTg-SNc ipsilateral (ipsilateral lesions in PPTg and SNc); PPTg-SNc contralateral (contralateral lesions in PPTg and SNc); sham lesions (of each type). SNc lesions were made with 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine HCl (MPTP, 0.6micromol); PPTg lesions with ibotenate (24nmol). After recovery, all rats underwent 50-trial sessions of 2-way active avoidance conditioning for 3 consecutive days. Rats with unilateral lesions in PPTg or SNc learnt this, however rats with contralateral (but not ipsilateral) combined lesions in both structures presented no sign of learning. This effect was not likely to be due to sensorimotor impairment because lesions did not affect reaction time to the tone or footshock during conditioning. However, an increased number of non-responses were observed in the rats with contralateral lesions. The results support the hypothesis that a functional interaction between PPTg and SNc is needed for CAR learning and performance.
Collapse
Affiliation(s)
- Mariza Bortolanza
- Laboratorio de Fisiologia e Farmacologia do Sistema Nervoso Central, Departamento de Farmacologia, Universidade Federal do Parana (UFPR), C.P. 19031, 81531-980 Curitiba, Brazil
| | | | | | | | | | | | | | | |
Collapse
|
134
|
Badre D, Kayser AS, D'Esposito M. Frontal cortex and the discovery of abstract action rules. Neuron 2010; 66:315-26. [PMID: 20435006 DOI: 10.1016/j.neuron.2010.03.025] [Citation(s) in RCA: 211] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/23/2010] [Indexed: 10/19/2022]
Abstract
Although we often encounter circumstances with which we have no prior experience, we rapidly learn how to behave in these novel situations. Such adaptive behavior relies on abstract behavioral rules that are generalizable, rather than concrete rules mapping specific cues to specific responses. Although the frontal cortex is known to support concrete rule learning, less well understood are the neural mechanisms supporting the acquisition of abstract rules. Here, we use a reinforcement learning paradigm to demonstrate that more anterior regions along the rostro-caudal axis of frontal cortex support rule learning at higher levels of abstraction. Moreover, these results indicate that when humans confront new rule learning problems, this rostro-caudal division of labor supports the search for relationships between context and action at multiple levels of abstraction simultaneously.
Collapse
Affiliation(s)
- David Badre
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02906, USA.
| | | | | |
Collapse
|
135
|
Berretta N, Bernardi G, Mercuri NB. Firing properties and functional connectivity of substantia nigra pars compacta neurones recorded with a multi-electrode array in vitro. J Physiol 2010; 588:1719-35. [PMID: 20351050 DOI: 10.1113/jphysiol.2010.189415] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Dopamine (DA) neurones of the substantia nigra pars compacta (SNc) are involved in a wide variety of functions, including motor control and reward-based learning. In order to gain new insights into the firing properties of neuronal ensembles in the SNc, we recorded extracellular single units from spontaneously active neurones, using a multi-electrode array (MEA) device in midbrain slices. The majority of neurones (50.21%) had a low firing frequency (1-3 Hz) and a stable pacemaker-like pattern, while others (44.84%) were irregular, but still firing at a low rate. The remaining population (4.95%) comprised neurones with a regular higher firing rate (5-10 Hz). High rate neurones, on the whole, were insensitive to DA (30 mum), while low rate neurones were mostly inhibited by DA, although responding either with a prominent or a weak inhibition. However, we recorded low rate regular neurones that were insensitive to DA, or irregular low rate neurones excited by DA. Interestingly, we found pairs of active neurones (12.10 +/- 3.14%) with a significant proportion of spikes occurring synchronously. Moreover, the crosscorrelation probability in each pair tended to increase in response to DA. In conclusion, MEA recordings in midbrain slices reveal a much more complex picture than previously reported with regard to the firing pattern and DA sensitivity of spontaneously active SNc neurones. Moreover, the study opens new prospectives for the in vitro investigation of functional connectivity in the midbrain dopaminergic system, thus proposing new targets for the pharmacological treatment of DA-dependent neurological disorders.
Collapse
Affiliation(s)
- Nicola Berretta
- Fondazione Santa Lucia IRCCS - Via del Fosso di Fiorano 64, 00143 Rome, Italy.
| | | | | |
Collapse
|
136
|
Samson RD, Frank MJ, Fellous JM. Computational models of reinforcement learning: the role of dopamine as a reward signal. Cogn Neurodyn 2010; 4:91-105. [PMID: 21629583 DOI: 10.1007/s11571-010-9109-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2009] [Revised: 02/17/2010] [Accepted: 02/24/2010] [Indexed: 11/27/2022] Open
Abstract
Reinforcement learning is ubiquitous. Unlike other forms of learning, it involves the processing of fast yet content-poor feedback information to correct assumptions about the nature of a task or of a set of stimuli. This feedback information is often delivered as generic rewards or punishments, and has little to do with the stimulus features to be learned. How can such low-content feedback lead to such an efficient learning paradigm? Through a review of existing neuro-computational models of reinforcement learning, we suggest that the efficiency of this type of learning resides in the dynamic and synergistic cooperation of brain systems that use different levels of computations. The implementation of reward signals at the synaptic, cellular, network and system levels give the organism the necessary robustness, adaptability and processing speed required for evolutionary and behavioral success.
Collapse
|
137
|
Wiecki TV, Frank MJ. Neurocomputational models of motor and cognitive deficits in Parkinson's disease. PROGRESS IN BRAIN RESEARCH 2010; 183:275-97. [PMID: 20696325 DOI: 10.1016/s0079-6123(10)83014-6] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
We review the contributions of biologically constrained computational models to our understanding of motor and cognitive deficits in Parkinson's disease (PD). The loss of dopaminergic neurons innervating the striatum in PD, and the well-established role of dopamine (DA) in reinforcement learning (RL), enable neural network models of the basal ganglia (BG) to derive concrete and testable predictions. We focus in this review on one simple underlying principle - the notion that reduced DA increases activity and causes long-term potentiation in the indirect pathway of the BG. We show how this theory can provide a unified account of diverse and seemingly unrelated phenomena in PD including progressive motor degeneration as well as cognitive deficits in RL, decision making and working memory. DA replacement therapy and deep brain stimulation can alleviate some aspects of these impairments, but can actually introduce negative effects such as motor dyskinesias and cognitive impulsivity. We discuss these treatment effects in terms of modulation of specific mechanisms within the computational framework. In addition, we review neurocomputational interpretations of increased impulsivity in the face of response conflict in patients with deep-brain-stimulation.
Collapse
|
138
|
Surmeier DJ, Shen W, Day M, Gertler T, Chan S, Tian X, Plotkin JL. The role of dopamine in modulating the structure and function of striatal circuits. PROGRESS IN BRAIN RESEARCH 2010; 183:149-67. [PMID: 20696319 DOI: 10.1016/s0079-6123(10)83008-0] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Dopamine (DA) is a key regulator of action selection and associative learning. The striatum has long been thought to be a major locus of DA action in this process. Although all striatal cell types express G protein-coupled receptors for DA, the effects of DA on principal medium spiny neurons (MSNs) understandably have received the most attention. In the two principal classes of MSN, DA receptor expression diverges, with striatonigral MSNs robustly expressing D(1) receptors and striatopallidal MSNs expressing D(2) receptors. In the last couple of years, our understanding of how these receptors and the intracellular signalling cascades that they couple to modulate dendritic physiology and synaptic plasticity has rapidly expanded, fuelled in large measure by the development of new optical and genetic tools. These tools also have enabled a rapid expansion of our understanding of the striatal adaptations in models of Parkinson's disease. This chapter highlights some of the major advances in these areas.
Collapse
Affiliation(s)
- D James Surmeier
- Department of Physiology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA.
| | | | | | | | | | | | | |
Collapse
|
139
|
Robinson OJ, Standing HR, DeVito EE, Cools R, Sahakian BJ. Dopamine precursor depletion improves punishment prediction during reversal learning in healthy females but not males. Psychopharmacology (Berl) 2010; 211:187-95. [PMID: 20495788 PMCID: PMC2892070 DOI: 10.1007/s00213-010-1880-1] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2010] [Accepted: 05/03/2010] [Indexed: 11/17/2022]
Abstract
INTRODUCTION The neurotransmitter dopamine has frequently been implicated in reward processing but is also, increasingly, implicated in punishment processing. We have previously shown that both patients with Parkinson's disease and healthy individuals with low dopamine (DA) synthesis are better at reversal learning based on punishment than reward. Here, we extend these prior findings by examining the effects of artificially reducing DA synthesis in healthy individuals performing this previously employed task. METHODS In a double-blind, placebo-controlled crossover design, we applied the acute tyrosine and phenylalanine depletion (ATPD) procedure to reduce global DA synthesis in 15 female and 14 male subjects. Each subject performed the reward- and punishment-based reversal-learning paradigm. RESULTS There was a significant three-way interaction between ATPD, the valence of the outcome signalling reversal and the gender of the participants. Examination of punishment and reward-based reversals separately revealed that this was driven by a significant improvement in punishment processing in female but not male subjects following DA depletion. CONCLUSIONS Reducing DA synthesis in healthy individuals shifted sensitivity of performance from reward to punishment processing. Gender differences in DA synthesis might underlie the selectivity of this effect to female subjects. Such gender biases may go some way towards explaining the gender biases in certain psychiatric disorders such as depression and Parkinson's disease.
Collapse
Affiliation(s)
- Oliver J. Robinson
- Department of Psychiatry and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, Addenbrooke’s Hospital, P.O. Box 189, Level E4, Hills Road, Cambridge, CB2 2QQ UK ,Section on Neuroimaging in Mood and Anxiety Disorders, National Institute of Mental Health, National Institutes of Health, Bethesda, MD USA
| | - Holly R. Standing
- Department of Psychiatry and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, Addenbrooke’s Hospital, P.O. Box 189, Level E4, Hills Road, Cambridge, CB2 2QQ UK
| | - Elise E. DeVito
- Department of Psychiatry and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, Addenbrooke’s Hospital, P.O. Box 189, Level E4, Hills Road, Cambridge, CB2 2QQ UK ,Department of Psychiatry, Yale University School of Medicine, New Haven, CT USA
| | - Roshan Cools
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Barbara J. Sahakian
- Department of Psychiatry and Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, Addenbrooke’s Hospital, P.O. Box 189, Level E4, Hills Road, Cambridge, CB2 2QQ UK
| |
Collapse
|
140
|
Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task. J Neurosci 2009; 29:15104-14. [PMID: 19955362 DOI: 10.1523/jneurosci.3524-09.2009] [Citation(s) in RCA: 170] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Making appropriate choices often requires the ability to learn the value of available options from experience. Parkinson's disease is characterized by a loss of dopamine neurons in the substantia nigra, neurons hypothesized to play a role in reinforcement learning. Although previous studies have shown that Parkinson's patients are impaired in tasks involving learning from feedback, they have not directly tested the widely held hypothesis that dopamine neuron activity specifically encodes the reward prediction error signal used in reinforcement learning models. To test a key prediction of this hypothesis, we fit choice behavior from a dynamic foraging task with reinforcement learning models and show that treatment with dopaminergic drugs alters choice behavior in a manner consistent with the theory. More specifically, we found that dopaminergic drugs selectively modulate learning from positive outcomes. We observed no effect of dopaminergic drugs on learning from negative outcomes. We also found a novel dopamine-dependent effect on decision making that is not accounted for by reinforcement learning models: perseveration in choice, independent of reward history, increases with Parkinson's disease and decreases with dopamine therapy.
Collapse
|
141
|
Surmeier DJ, Plotkin J, Shen W. Dopamine and synaptic plasticity in dorsal striatal circuits controlling action selection. Curr Opin Neurobiol 2009; 19:621-8. [PMID: 19896832 DOI: 10.1016/j.conb.2009.10.003] [Citation(s) in RCA: 143] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 10/07/2009] [Indexed: 01/29/2023]
Abstract
The striatum is thought to play a central role in learning how to choose acts that lead to reward and avoid punishment. Dopamine-dependent modification of striatal synapses in the action selection circuitry has long been thought to be a key step toward this type of learning. The development of new genetic and optical tools has pushed this field forward in the last couple of years, demanding a re-evaluation of models of how experience controls dopamine-dependent synaptic plasticity and how disease states like Parkinson's disease affect the striatal circuitry.
Collapse
Affiliation(s)
- D James Surmeier
- Department of Physiology, Northwestern University, Feinberg School of Medicine, Chicago, IL 60611, USA
| | | | | |
Collapse
|
142
|
Frank MJ, Surmeier DJ. Do Substantia Nigra Dopaminergic Neurons Differentiate Between Reward and Punishment? J Mol Cell Biol 2009; 1:15-6. [DOI: 10.1093/jmcb/mjp010] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
143
|
Abstract
Dopaminergic neurons of the substantia nigra pars compacta are autonomous pacemakers. This activity is responsible for the sustained release of dopamine necessary for the proper functioning of target structures, such as the striatum. Somatodendritic L-type Ca2+ channels have long been viewed as important, if not necessary, for this activity. The studies reported here challenge this viewpoint. Using a combination of optical and electrophysiological approaches in brain slices, it was found that antagonism of L-type Ca2+ channel effectively stopped dendritic Ca2+ oscillations but left autonomous pacemaking unchanged. Moreover, damping intracellular Ca2+ oscillations with exogenous buffer had little effect on pacemaking rate. Although not necessary for pacemaking, L-type channels helped support pacemaking when challenged with cationic channel blockers. Simulations suggested that the insensitivity to antagonism of L-type channels reflected the multichannel nature of the pacemaking process. The robustness of pacemaking underscores its biological importance and provides a framework for understanding how therapeutics targeting L-type Ca2+ channels might protect dopaminergic neurons in Parkinson's disease without compromising their function.
Collapse
|
144
|
Frank MJ, Hutchison K. Genetic contributions to avoidance-based decisions: striatal D2 receptor polymorphisms. Neuroscience 2009; 164:131-40. [PMID: 19393722 DOI: 10.1016/j.neuroscience.2009.04.048] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2008] [Revised: 03/31/2009] [Accepted: 04/02/2009] [Indexed: 10/20/2022]
Abstract
Individuals differ in their tendencies to seek positive decision outcomes or to avoid negative ones. At the neurobiological level, our model suggests that phasic changes in dopamine support learning to reinforce good decisions via striatal D1 receptors, and to avoid maladaptive choices via striatal D2 receptors. Accordingly, in a previous study individual differences in positive and negative learning were strongly modulated by two genetic polymorphisms factors related to striatal D1 and D2 function, respectively. Nevertheless, whereas the role for dopamine in positive learning is relatively well accepted, that in learning to avoid negative outcomes is more controversial. Here we further explore D2-receptor-related genetic contributions to probabilistic avoidance in humans, in light of recent data showing that particular DRD2 polymorphisms are associated with functional modulation of receptor expression [Zhang Y, Bertolino A, Fazio L, Blasi G, Rampino A, Romano R, Lee M-LT, Xiao T, Papp A, Wang D, Sadée W (2007) Polymorphisms in human dopamine d2 receptor gene affect gene expression, splicing, and neuronal activity during working memory. Proc Natl Acad Sci U S A 104(51):20552-20557]. We find that a promoter polymorphism rs12364283 associated with transcription and D2 receptor density was strongly and selectively predictive of avoidance-based decisions. Two further polymorphisms (rs2283265 and rs1076560) associated with relatively reduced presynaptic relative to postsynaptic D2 receptor expression were predictive of relative impairments in negative compared to positive decisions. These previously undocumented effects of DRD2 polymorphisms were largely independent of those we reported previously for the C957T polymorphism (rs6277) associated with striatal D2 density. In contrast, effects of the commonly studied Taq1A polymorphism on reinforcement-based decisions were due to indirect association with C957T. Taken together these findings suggest multiple D2-dependent genetic mechanisms contributing to avoidance. We discuss these effects in the context of neurocomputational models of reinforcement leaning in the basal ganglia.
Collapse
Affiliation(s)
- M J Frank
- Department of Cognitive & Linguistic Sciences, Brown Institute for Brain Science, Brown University, Providence, RI 02912, USA.
| | | |
Collapse
|