1
|
Mahmood S, Teo C, Sim J, Zhang W, Muyun J, Bhuvana R, Teo K, Yeo TT, Lu J, Gulyas B, Guan C. The application of eXplainable artificial intelligence in studying cognition: A scoping review. IBRAIN 2024; 10:245-265. [PMID: 39346792 PMCID: PMC11427810 DOI: 10.1002/ibra.12174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 08/12/2024] [Accepted: 08/19/2024] [Indexed: 10/01/2024]
Abstract
The rapid advancement of artificial intelligence (AI) has sparked renewed discussions on its trustworthiness and the concept of eXplainable AI (XAI). Recent research in neuroscience has emphasized the relevance of XAI in studying cognition. This scoping review aims to identify and analyze various XAI methods used to study the mechanisms and features of cognitive function and dysfunction. In this study, the collected evidence is qualitatively assessed to develop an effective framework for approaching XAI in cognitive neuroscience. Based on the Joanna Briggs Institute and preferred reporting items for systematic reviews and meta-analyses extension for scoping review guidelines, we searched for peer-reviewed articles on MEDLINE, Embase, Web of Science, Cochrane Central Register of Controlled Trials, and Google Scholar. Two reviewers performed data screening, extraction, and thematic analysis in parallel. Twelve eligible experimental studies published in the past decade were included. The results showed that the majority (75%) focused on normal cognitive functions such as perception, social cognition, language, executive function, and memory, while others (25%) examined impaired cognition. The predominant XAI methods employed were intrinsic XAI (58.3%), followed by attribution-based (41.7%) and example-based (8.3%) post hoc methods. Explainability was applied at a local (66.7%) or global (33.3%) scope. The findings, predominantly correlational, were anatomical (83.3%) or nonanatomical (16.7%). In conclusion, while these XAI techniques were lauded for their predictive power, robustness, testability, and plausibility, limitations included oversimplification, confounding factors, and inconsistencies. The reviewed studies showcased the potential of XAI models while acknowledging current challenges in causality and oversimplification, particularly emphasizing the need for reproducibility.
Collapse
Affiliation(s)
- Shakran Mahmood
- Lee Kong Chian School of MedicineNanyang Technological UniversitySingaporeSingapore
| | - Colin Teo
- Lee Kong Chian School of MedicineNanyang Technological UniversitySingaporeSingapore
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
- Division of Neurosurgery, Department of SurgeryNational University HospitalSingaporeSingapore
| | - Jeremy Sim
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
| | - Wei Zhang
- Lee Kong Chian School of MedicineNanyang Technological UniversitySingaporeSingapore
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
| | - Jiang Muyun
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
- School of Computer Science and EngineeringNanyang Technological UniversitySingaporeSingapore
| | - R. Bhuvana
- Lee Kong Chian School of MedicineNanyang Technological UniversitySingaporeSingapore
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
| | - Kejia Teo
- Division of Neurosurgery, Department of SurgeryNational University HospitalSingaporeSingapore
| | - Tseng Tsai Yeo
- Division of Neurosurgery, Department of SurgeryNational University HospitalSingaporeSingapore
| | - Jia Lu
- Defence Medical and Environmental Research InstituteDSO National LaboratoriesSingaporeSingapore
| | - Balazs Gulyas
- Lee Kong Chian School of MedicineNanyang Technological UniversitySingaporeSingapore
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
| | - Cuntai Guan
- Centre for Neuroimaging ResearchNanyang Technological UniversitySingaporeSingapore
- School of Computer Science and EngineeringNanyang Technological UniversitySingaporeSingapore
| |
Collapse
|
2
|
Dura-Bernal S, Neymotin SA, Suter BA, Dacre J, Moreira JVS, Urdapilleta E, Schiemann J, Duguid I, Shepherd GMG, Lytton WW. Multiscale model of primary motor cortex circuits predicts in vivo cell-type-specific, behavioral state-dependent dynamics. Cell Rep 2023; 42:112574. [PMID: 37300831 PMCID: PMC10592234 DOI: 10.1016/j.celrep.2023.112574] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 02/27/2023] [Accepted: 05/12/2023] [Indexed: 06/12/2023] Open
Abstract
Understanding cortical function requires studying multiple scales: molecular, cellular, circuit, and behavioral. We develop a multiscale, biophysically detailed model of mouse primary motor cortex (M1) with over 10,000 neurons and 30 million synapses. Neuron types, densities, spatial distributions, morphologies, biophysics, connectivity, and dendritic synapse locations are constrained by experimental data. The model includes long-range inputs from seven thalamic and cortical regions and noradrenergic inputs. Connectivity depends on cell class and cortical depth at sublaminar resolution. The model accurately predicts in vivo layer- and cell-type-specific responses (firing rates and LFP) associated with behavioral states (quiet wakefulness and movement) and experimental manipulations (noradrenaline receptor blockade and thalamus inactivation). We generate mechanistic hypotheses underlying the observed activity and analyzed low-dimensional population latent dynamics. This quantitative theoretical framework can be used to integrate and interpret M1 experimental data and sheds light on the cell-type-specific multiscale dynamics associated with several experimental conditions and behaviors.
Collapse
Affiliation(s)
- Salvador Dura-Bernal
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA; Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA.
| | - Samuel A Neymotin
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA; Department of Psychiatry, Grossman School of Medicine, New York University (NYU), New York, NY, USA
| | - Benjamin A Suter
- Department of Physiology, Northwestern University, Evanston, IL, USA
| | - Joshua Dacre
- Centre for Discovery Brain Sciences, Edinburgh Medical School: Biomedical Sciences, University of Edinburgh, Edinburgh, UK
| | - Joao V S Moreira
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA
| | - Eugenio Urdapilleta
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA
| | - Julia Schiemann
- Centre for Discovery Brain Sciences, Edinburgh Medical School: Biomedical Sciences, University of Edinburgh, Edinburgh, UK; Center for Integrative Physiology and Molecular Medicine, Saarland University, Saarbrücken, Germany
| | - Ian Duguid
- Centre for Discovery Brain Sciences, Edinburgh Medical School: Biomedical Sciences, University of Edinburgh, Edinburgh, UK
| | | | - William W Lytton
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA; Department of Neurology, Kings County Hospital Center, Brooklyn, NY, USA
| |
Collapse
|
3
|
Weiss AR, Korzeniewska A, Chrabaszcz A, Bush A, Fiez JA, Crone NE, Richardson RM. Lexicality-Modulated Influence of Auditory Cortex on Subthalamic Nucleus During Motor Planning for Speech. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2023; 4:53-80. [PMID: 37229140 PMCID: PMC10205077 DOI: 10.1162/nol_a_00086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 10/18/2022] [Indexed: 05/27/2023]
Abstract
Speech requires successful information transfer within cortical-basal ganglia loop circuits to produce the desired acoustic output. For this reason, up to 90% of Parkinson's disease patients experience impairments of speech articulation. Deep brain stimulation (DBS) is highly effective in controlling the symptoms of Parkinson's disease, sometimes alongside speech improvement, but subthalamic nucleus (STN) DBS can also lead to decreases in semantic and phonological fluency. This paradox demands better understanding of the interactions between the cortical speech network and the STN, which can be investigated with intracranial EEG recordings collected during DBS implantation surgery. We analyzed the propagation of high-gamma activity between STN, superior temporal gyrus (STG), and ventral sensorimotor cortices during reading aloud via event-related causality, a method that estimates strengths and directionalities of neural activity propagation. We employed a newly developed bivariate smoothing model based on a two-dimensional moving average, which is optimal for reducing random noise while retaining a sharp step response, to ensure precise embedding of statistical significance in the time-frequency space. Sustained and reciprocal neural interactions between STN and ventral sensorimotor cortex were observed. Moreover, high-gamma activity propagated from the STG to the STN prior to speech onset. The strength of this influence was affected by the lexical status of the utterance, with increased activity propagation during word versus pseudoword reading. These unique data suggest a potential role for the STN in the feedforward control of speech.
Collapse
Affiliation(s)
- Alexander R. Weiss
- JHU Cognitive Neurophysiology and BMI Lab, Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Anna Korzeniewska
- JHU Cognitive Neurophysiology and BMI Lab, Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Anna Chrabaszcz
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Alan Bush
- Brain Modulation Lab, Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Julie A. Fiez
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA, USA
- University of Pittsburgh Brain Institute, Pittsburgh, PA, USA
| | - Nathan E. Crone
- JHU Cognitive Neurophysiology and BMI Lab, Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Robert M. Richardson
- Brain Modulation Lab, Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| |
Collapse
|
4
|
Gilbertson T, Steele D. Tonic dopamine, uncertainty and basal ganglia action selection. Neuroscience 2021; 466:109-124. [PMID: 34015370 DOI: 10.1016/j.neuroscience.2021.05.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 05/04/2021] [Accepted: 05/08/2021] [Indexed: 11/29/2022]
Abstract
To make optimal decisions in uncertain circumstances flexible adaption of behaviour is required; exploring alternatives when the best choice is unknown, exploiting what is known when that is best. Using a computational model of the basal ganglia, we propose that switches between exploratory and exploitative decisions are mediated by the interaction between tonic dopamine and cortical input to the basal ganglia. We show that a biologically detailed action selection circuit model, endowed with dopamine dependant striatal plasticity, can optimally solve the explore-exploit problem, estimating the true underlying state of a noisy Gaussian diffusion process. Critical to the model's performance was a fluctuating level of tonic dopamine which increased under conditions of uncertainty. With an optimal range of tonic dopamine, explore-exploit decisions were mediated by the effects of tonic dopamine on the precision of the model action selection mechanism. Under conditions of uncertain reward pay-out, the model's reduced selectivity allowed disinhibition of multiple alternative actions to be explored at random. Conversely, when uncertainly about reward pay-out was low, enhanced selectivity of the action selection circuit facilitated exploitation of the high value choice. Model performance was at the level of a Kalman filter which provides an optimal solution for the task. These simulations support the idea that this subcortical neural circuit may have evolved to facilitate decision making in non-stationary reward environments. The model generates several experimental predictions with relevance to abnormal decision making in neuropsychiatric and neurological disease.
Collapse
Affiliation(s)
- Tom Gilbertson
- Department of Neurology, Level 6, South Block, Ninewells Hospital & Medical School, Dundee DD2 4BF, UK; Division of Imaging Science and Technology, Medical School, University of Dundee, DD2 4BF, UK.
| | - Douglas Steele
- Division of Imaging Science and Technology, Medical School, University of Dundee, DD2 4BF, UK
| |
Collapse
|
5
|
Balice-Gordon R, Honey GD, Chatham C, Arce E, Duvvuri S, Naylor MG, Liu W, Xie Z, DeMartinis N, Harel BT, Braley GH, Kozak R, Park L, Gray DL. A Neurofunctional Domains Approach to Evaluate D1/D5 Dopamine Receptor Partial Agonism on Cognition and Motivation in Healthy Volunteers With Low Working Memory Capacity. Int J Neuropsychopharmacol 2020; 23:287-299. [PMID: 32055822 PMCID: PMC7251631 DOI: 10.1093/ijnp/pyaa007] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Revised: 01/02/2020] [Accepted: 02/12/2020] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Dopamine D1 receptor signaling plays key roles in core domains of neural function, including cognition and reward processing; however, many questions remain about the functions of circuits modulated by dopamine D1 receptor, largely because clinically viable, selective agonists have yet to be tested in humans. METHODS Using a novel, exploratory neurofunctional domains study design, we assessed the safety, tolerability, pharmacodynamics, and pharmacokinetics of PF-06412562, a selective D1/D5R partial agonist, in healthy male volunteers who met prespecified criteria for low working memory capacity. Functional magnetic resonance imaging, electrophysiologic endpoints, and behavioral paradigms were used to assess working memory, executive function, and motivation/reward processing following multiple-dose administration of PF-06412562. A total of 77 patients were assigned PF-06412562 (3 mg twice daily and 15 mg twice daily) or placebo administered for 5 to 7 days. Due to the exploratory nature of the study, it was neither powered for any specific treatment effect nor corrected for multiple comparisons. RESULTS Nominally significant improvements from baseline in cognitive endpoints were observed in all 3 groups; however, improvements in PF-06412562-treated patients were less than in placebo-treated participants. Motivation/reward processing endpoints were variable. PF-06412562 was safe and well tolerated, with no serious adverse events, severe adverse events, or adverse events leading to dose reduction or temporary discontinuation except for 1 permanent discontinuation due to increased orthostatic heart rate. CONCLUSIONS PF-06412562, in the dose range and patient population explored in this study, did not improve cognitive function or motivation/reward processing more than placebo over the 5- to 7-day treatment period. CLINICALTRIALS.GOV IDENTIFIER NCT02306876.
Collapse
Affiliation(s)
| | - Garry D Honey
- Pfizer Worldwide Research and Development, Cambridge, MA
| | | | - Estibaliz Arce
- Pfizer Worldwide Research and Development, Cambridge, MA
| | | | | | - Wenlei Liu
- Pfizer Worldwide Research and Development, Cambridge, MA
| | - Zhiyong Xie
- Pfizer Worldwide Research and Development, Cambridge, MA
| | | | - Brian T Harel
- Pfizer Worldwide Research and Development, Cambridge, MA
| | | | - Rouba Kozak
- Pfizer Worldwide Research and Development, Cambridge, MA
| | - Lovingly Park
- California Clinical Trials Medical Group/PAREXEL International, Glendale, CA
| | - David L Gray
- Pfizer Worldwide Research and Development, Cambridge, MA
| |
Collapse
|
6
|
Morita K, Kawaguchi Y. A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine. Front Neural Circuits 2019; 12:111. [PMID: 30687019 PMCID: PMC6338031 DOI: 10.3389/fncir.2018.00111] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 11/29/2018] [Indexed: 01/07/2023] Open
Abstract
The hypothesis that the basal-ganglia direct and indirect pathways represent goodness (or benefit) and badness (or cost) of options, respectively, explains a wide range of phenomena. However, this hypothesis, named the Opponent Actor Learning (OpAL), still has limitations. Structurally, the OpAL model does not incorporate differentiation of the two types of cortical inputs to the basal-ganglia pathways received from intratelencephalic (IT) and pyramidal-tract (PT) neurons. Functionally, the OpAL model does not describe the temporal-difference (TD)-type reward-prediction-error (RPE), nor explains how RPE is calculated in the circuitry connecting to the DA neurons. In fact, there is a different hypothesis on the basal-ganglia pathways and DA, named the Cortico-Striatal-Temporal-Difference (CS-TD) model. The CS-TD model differentiates the IT and PT inputs, describes the TD-type RPE, and explains how TD-RPE is calculated. However, a critical difficulty in this model lies in its assumption that DA induces the same direction of plasticity in both direct and indirect pathways, which apparently contradicts the experimentally observed opposite effects of DA on these pathways. Here, we propose a new hypothesis that integrates the OpAL and CS-TD models. Specifically, we propose that the IT-basal-ganglia pathways represent goodness/badness of current options while the PT-indirect pathway represents the overall value of the previously chosen option, and both of these have influence on the DA neurons, through the basal-ganglia output, so that a variant of TD-RPE is calculated. A key assumption is that opposite directions of plasticity are induced upon phasic activation of DA neurons in the IT-indirect pathway and PT-indirect pathway because of different profiles of IT and PT inputs. Specifically, at PT→indirect-pathway-medium-spiny-neuron (iMSN) synapses, sustained glutamatergic inputs generate rich adenosine, which allosterically prevents DA-D2 receptor signaling and instead favors adenosine-A2A receptor signaling. Then, phasic DA-induced phasic adenosine, which reflects TD-RPE, causes long-term synaptic potentiation. In contrast, at IT→iMSN synapses where adenosine is scarce, phasic DA causes long-term synaptic depression via D2 receptor signaling. This new Opponency and Temporal-Difference (OTD) model provides unique predictions, part of which is potentially in line with recently reported activity patterns of neurons in the globus pallidus externus on the indirect pathway.
Collapse
Affiliation(s)
- Kenji Morita
- Physical and Health Education, Graduate School of Education, The University of Tokyo, Tokyo, Japan.,International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study, Tokyo, Japan
| | - Yasuo Kawaguchi
- Division of Cerebral Circuitry, National Institute for Physiological Sciences, Okazaki, Japan.,Department of Physiological Sciences, Graduate University for Advanced Studies, Okazaki, Japan
| |
Collapse
|
7
|
Jeanneteau F, Borie A, Chao MV, Garabedian MJ. Bridging the Gap between Brain-Derived Neurotrophic Factor and Glucocorticoid Effects on Brain Networks. Neuroendocrinology 2019; 109:277-284. [PMID: 30572337 DOI: 10.1159/000496392] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 12/19/2018] [Indexed: 11/19/2022]
Abstract
Behavioral choices made by the brain during stress depend on glucocorticoid and brain-derived neurotrophic factor (BDNF) signaling pathways acting in synchrony in the mesolimbic (reward) and corticolimbic (emotion) neural networks. Deregulated expression of BDNF and glucocorticoid receptors in brain valuation areas may compromise the integration of signals. Glucocorticoid receptor phosphorylation upon BDNF signaling in neurons represents one mechanism underlying the integration of BDNF and glucocorticoid signals that when off balance may lay the foundation of maladaptations to stress. Here, we propose that BDNF signaling conditions glucocorticoid responses impacting neural plasticity in the mesocorticolimbic system. This provides a novel molecular framework for understanding how brain networks use BDNF and glucocorticoid signaling contingencies to forge receptive neuronal fields in temporal domains defined by behavioral experience, and in mood disorders.
Collapse
Affiliation(s)
- Freddy Jeanneteau
- Institut de Genomique Fonctionnelle, Inserm, CNRS, University of Montpellier, Montpellier, France,
| | - Amélie Borie
- Institut de Genomique Fonctionnelle, Inserm, CNRS, University of Montpellier, Montpellier, France
| | - Moses V Chao
- Skirball Institute of Biomolecular Medicine, New York University Langone Medical Center, New York, New York, USA
| | | |
Collapse
|
8
|
Kawaguchi Y. Pyramidal Cell Subtypes and Their Synaptic Connections in Layer 5 of Rat Frontal Cortex. Cereb Cortex 2018; 27:5755-5771. [PMID: 29028949 DOI: 10.1093/cercor/bhx252] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 09/06/2017] [Indexed: 12/31/2022] Open
Abstract
The frontal cortical areas make a coordinated response that generates appropriate behavior commands, using individual local circuits with corticostriatal and corticocortical connections in longer time scales than sensory areas. In secondary motor cortex (M2), situated between the prefrontal and primary motor areas, major subtypes of layer 5 corticostriatal cells are crossed-corticostriatal (CCS) cells innervating both sides of striatum, and corticopontine (CPn) cells projecting to the ipsilateral striatum and pontine nuclei. CCS cells innervate CPn cells unidirectionally: the former are therefore hierarchically higher than the latter among L5 corticostriatal cells. CCS cells project directly to both frontal and nonfrontal areas. On the other hand, CPn cells innervate the thalamus and layer 1a of frontal areas, where thalamic fibers relaying basal ganglia outputs are distributed. Thus, CCS cells can make activities of frontal areas in concert with those of nonfrontal area using corticocortical loops, whereas CPn cells are more involved in closed corticostriatal loops than CCS cells. Since reciprocal connections between CPn cells with facilitatory synapses may be related to persistent activity, CPn cells play a key role of longer time constant processes in corticostriatal as well as in corticocortical loops between the frontal areas.
Collapse
Affiliation(s)
- Yasuo Kawaguchi
- Division of Cerebral Circuitry, National Institute for Physiological Sciences, Okazaki 444-8787, Japan.,Department of Physiological Sciences, SOKENDAI (Graduate University for Advanced Studies), Okazaki, Japan
| |
Collapse
|
9
|
A Neural Circuit Mechanism for the Involvements of Dopamine in Effort-Related Choices: Decay of Learned Values, Secondary Effects of Depletion, and Calculation of Temporal Difference Error. eNeuro 2018; 5:eN-NWR-0021-18. [PMID: 29468191 PMCID: PMC5820541 DOI: 10.1523/eneuro.0021-18.2018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2018] [Accepted: 01/11/2018] [Indexed: 12/17/2022] Open
Abstract
Dopamine has been suggested to be crucially involved in effort-related choices. Key findings are that dopamine depletion (i) changed preference for a high-cost, large-reward option to a low-cost, small-reward option, (ii) but not when the large-reward option was also low-cost or the small-reward option gave no reward, (iii) while increasing the latency in all the cases but only transiently, and (iv) that antagonism of either dopamine D1 or D2 receptors also specifically impaired selection of the high-cost, large-reward option. The underlying neural circuit mechanisms remain unclear. Here we show that findings i–iii can be explained by the dopaminergic representation of temporal-difference reward-prediction error (TD-RPE), whose mechanisms have now become clarified, if (1) the synaptic strengths storing the values of actions mildly decay in time and (2) the obtained-reward-representing excitatory input to dopamine neurons increases after dopamine depletion. The former is potentially caused by background neural activity–induced weak synaptic plasticity, and the latter is assumed to occur through post-depletion increase of neural activity in the pedunculopontine nucleus, where neurons representing obtained reward exist and presumably send excitatory projections to dopamine neurons. We further show that finding iv, which is nontrivial given the suggested distinct functions of the D1 and D2 corticostriatal pathways, can also be explained if we additionally assume a proposed mechanism of TD-RPE calculation, in which the D1 and D2 pathways encode the values of actions with a temporal difference. These results suggest a possible circuit mechanism for the involvements of dopamine in effort-related choices and, simultaneously, provide implications for the mechanisms of TD-RPE calculation.
Collapse
|
10
|
Héricé C, Khalil R, Moftah M, Boraud T, Guthrie M, Garenne A. Decision making under uncertainty in a spiking neural network model of the basal ganglia. J Integr Neurosci 2016; 15:515-538. [PMID: 28002987 DOI: 10.1142/s021963521650028x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The mechanisms of decision-making and action selection are generally thought to be under the control of parallel cortico-subcortical loops connecting back to distinct areas of cortex through the basal ganglia and processing motor, cognitive and limbic modalities of decision-making. We have used these properties to develop and extend a connectionist model at a spiking neuron level based on a previous rate model approach. This model is demonstrated on decision-making tasks that have been studied in primates and the electrophysiology interpreted to show that the decision is made in two steps. To model this, we have used two parallel loops, each of which performs decision-making based on interactions between positive and negative feedback pathways. This model is able to perform two-level decision-making as in primates. We show here that, before learning, synaptic noise is sufficient to drive the decision-making process and that, after learning, the decision is based on the choice that has proven most likely to be rewarded. The model is then submitted to lesion tests, reversal learning and extinction protocols. We show that, under these conditions, it behaves in a consistent manner and provides predictions in accordance with observed experimental data.
Collapse
Affiliation(s)
- Charlotte Héricé
- * University de Bordeaux, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France.,† CNRS, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France
| | - Radwa Khalil
- † CNRS, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France
| | | | - Thomas Boraud
- * University de Bordeaux, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France.,† CNRS, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France
| | - Martin Guthrie
- * University de Bordeaux, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France.,† CNRS, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France
| | - André Garenne
- * University de Bordeaux, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France.,† CNRS, Institut des Maladies Neurodégénératives, UMR 5293, 33000 Bordeaux, France
| |
Collapse
|
11
|
Striatal Activity and Reward Relativity: Neural Signals Encoding Dynamic Outcome Valuation. eNeuro 2016; 3:eN-NWR-0022-16. [PMID: 27822506 PMCID: PMC5089537 DOI: 10.1523/eneuro.0022-16.2016] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2016] [Revised: 10/06/2016] [Accepted: 10/07/2016] [Indexed: 11/21/2022] Open
Abstract
The striatum is a key brain region involved in reward processing. Striatal activity has been linked to encoding reward magnitude and integrating diverse reward outcome information. Recent work has supported the involvement of striatum in the valuation of outcomes. The present work extends this idea by examining striatal activity during dynamic shifts in value that include different levels and directions of magnitude disparity. A novel task was used to produce diverse relative reward effects on a chain of instrumental action. Rats (Rattus norvegicus) were trained to respond to cues associated with specific outcomes varying by food pellet magnitude. Animals were exposed to single-outcome sessions followed by mixed-outcome sessions, and neural activity was compared among identical outcome trials from the different behavioral contexts. Results recording striatal activity show that neural responses to different task elements reflect incentive contrast as well as other relative effects that involve generalization between outcomes or possible influences of outcome variety. The activity that was most prevalent was linked to food consumption and post-food consumption periods. Relative encoding was sensitive to magnitude disparity. A within-session analysis showed strong contrast effects that were dependent upon the outcome received in the immediately preceding trial. Significantly higher numbers of responses were found in ventral striatum linked to relative outcome effects. Our results support the idea that relative value can incorporate diverse relationships, including comparisons from specific individual outcomes to general behavioral contexts. The striatum contains these diverse relative processes, possibly enabling both a higher information yield concerning value shifts and a greater behavioral flexibility.
Collapse
|
12
|
Kato A, Morita K. Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS Comput Biol 2016; 12:e1005145. [PMID: 27736881 PMCID: PMC5063413 DOI: 10.1371/journal.pcbi.1005145] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 09/14/2016] [Indexed: 12/12/2022] Open
Abstract
It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced behavior, and suggested that this response represents a motivational signal. We have previously shown that RPE can sustain if there is decay/forgetting of learned-values, which can be implemented as decay of synaptic strengths storing learned-values. This account, however, did not explain the suggested link between tonic/sustained DA and motivation. In the present work, we explored the motivational effects of the value-decay in self-paced approach behavior, modeled as a series of ‘Go’ or ‘No-Go’ selections towards a goal. Through simulations, we found that the value-decay can enhance motivation, specifically, facilitate fast goal-reaching, albeit counterintuitively. Mathematical analyses revealed that underlying potential mechanisms are twofold: (1) decay-induced sustained RPE creates a gradient of ‘Go’ values towards a goal, and (2) value-contrasts between ‘Go’ and ‘No-Go’ are generated because while chosen values are continually updated, unchosen values simply decay. Our model provides potential explanations for the key experimental findings that suggest DA's roles in motivation: (i) slowdown of behavior by post-training blockade of DA signaling, (ii) observations that DA blockade severely impairs effortful actions to obtain rewards while largely sparing seeking of easily obtainable rewards, and (iii) relationships between the reward amount, the level of motivation reflected in the speed of behavior, and the average level of DA. These results indicate that reinforcement learning with value-decay, or forgetting, provides a parsimonious mechanistic account for the DA's roles in value-learning and motivation. Our results also suggest that when biological systems for value-learning are active even though learning has apparently converged, the systems might be in a state of dynamic equilibrium, where learning and forgetting are balanced. Dopamine (DA) has been suggested to have two reward-related roles: (1) representing reward-prediction-error (RPE), and (2) providing motivational drive. Role(1) is based on the physiological results that DA responds to unpredicted but not predicted reward, whereas role(2) is supported by the pharmacological results that blockade of DA signaling causes motivational impairments such as slowdown of self-paced behavior. So far, these two roles are considered to be played by two different temporal patterns of DA signals: role(1) by phasic signals and role(2) by tonic/sustained signals. However, recent studies have found sustained DA signals with features indicative of both roles (1) and (2), complicating this picture. Meanwhile, whereas synaptic/circuit mechanisms for role(1), i.e., how RPE is calculated in the upstream of DA neurons and how RPE-dependent update of learned-values occurs through DA-dependent synaptic plasticity, have now become clarified, mechanisms for role(2) remain unclear. In this work, we modeled self-paced behavior by a series of ‘Go’ or ‘No-Go’ selections in the framework of reinforcement-learning assuming DA's role(1), and demonstrated that incorporation of decay/forgetting of learned-values, which is presumably implemented as decay of synaptic strengths storing learned-values, provides a potential unified mechanistic account for the DA's two roles, together with its various temporal patterns.
Collapse
Affiliation(s)
- Ayaka Kato
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
| | - Kenji Morita
- Physical and Health Education, Graduate School of Education, The University of Tokyo, Tokyo, Japan
- * E-mail:
| |
Collapse
|