1
|
Whelan MT, Jimenez-Rodriguez A, Prescott TJ, Vasilaki E. A robotic model of hippocampal reverse replay for reinforcement learning. BIOINSPIRATION & BIOMIMETICS 2022; 18:015007. [PMID: 36327454 DOI: 10.1088/1748-3190/ac9ffc] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 11/03/2022] [Indexed: 06/16/2023]
Abstract
Hippocampal reverse replay, a phenomenon in which recently active hippocampal cells reactivate in the reverse order, is thought to contribute to learning, particularly reinforcement learning (RL), in animals. Here, we present a novel computational model which exploits reverse replay to improve stability and performance on a homing task. The model takes inspiration from the hippocampal-striatal network, and learning occurs via a three-factor RL rule. To augment this model with hippocampal reverse replay, we derived a policy gradient learning rule that associates place-cell activity with responses in cells representing actions and a supervised learning rule of the same form, interpreting the replay activity as a 'target' frequency. We evaluated the model using a simulated robot spatial navigation task inspired by the Morris water maze. Results suggest that reverse replay can improve performance stability over multiple trials. Our model exploits reverse reply as an additional source for propagating information about desirable synaptic changes, reducing the requirements for long-time scales in eligibility traces combined with low learning rates. We conclude that reverse replay can positively contribute to RL, although less stable learning is possible in its absence. Analogously, we postulate that reverse replay may enhance RL in the mammalian hippocampal-striatal system rather than provide its core mechanism.
Collapse
Affiliation(s)
- Matthew T Whelan
- Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom
- Sheffield Robotics, Sheffield, United Kingdom
| | - Alejandro Jimenez-Rodriguez
- Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom
- Sheffield Robotics, Sheffield, United Kingdom
| | - Tony J Prescott
- Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom
- Sheffield Robotics, Sheffield, United Kingdom
| | - Eleni Vasilaki
- Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom
- Sheffield Robotics, Sheffield, United Kingdom
| |
Collapse
|
2
|
Chen M, Liu F, Wen L, Hu X. Nonlinear relationship between CAN current and C a 2 + influx underpins synergistic action of muscarinic and NMDA receptors on bursts induction in midbrain dopaminergic neurons. Cogn Neurodyn 2022; 16:719-731. [PMID: 35603052 PMCID: PMC9120320 DOI: 10.1007/s11571-021-09740-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 10/08/2021] [Accepted: 10/19/2021] [Indexed: 12/29/2022] Open
Abstract
Bursting of midbrain dopamine (DA) neurons is believed to represent an important reward signal that instructs and reinforces goal-directed behaviors. In DA neurons, many afferents, including cholinergic and glutamatergic inputs, induce bursting, and it is suggested that a synergy exists between these afferents in bursting induction. However, the underlying mechanisms of the role and the synergy of muscarinic receptors (mAChRs) and NMDA receptors (NMDARs) in bursting induction remain unclear. Present work bestowed analysis using a mathematical model of DA neurons to demonstrate the underlying mechanisms. Activation of mAChRs, leading to rapid translocation of TRPC channels to cell surface, recruited C a 2 + -activated nonspecific (CAN) current ( I CAN ), meanwhile NMDARs excitation triggered C a 2 + influx, which induced the positive feedback loop of C a 2 + and I CAN , respectively, yielded a robust ramping depolarization with a superimposed high-frequency spiking. In some DA cells, neither NMDARs nor mAChRs induced positive feedback loop unless they were activated simultaneously to induce bursting. Our experimental results verified those theoretical findings. These together unveil the underlying mechanisms of the role and synergy of mAChRs and NMDARs in bursting induction emerge from the nonlinear relationship between C a 2 + influx and I CAN . Given the diverse and complex nature of neural circuitry and the DA neuron heterogeneity, our work provides new insights to understand specific afferents, the synergy between those afferents, and the differences in intrinsic excitability to be integrated by the bursting to accurately characterize the dopamine signals in the valances of reward and reinforcement, and a broad spectrum of neuropsychiatric disorders.
Collapse
Affiliation(s)
- Mengjiao Chen
- College of Life Sciences, Leshan Normal University, Leshan, 614000 China
- Key Laboratory of Sichuan Institute for Protecting Endangered Birds in the Southwest Mountains, Leshan Normal University, Leshan, 614000 China
- Key Laboratory of MOE for Modern Teaching Technology, Shaanxi Normal University, Xi’an, 710062 China
| | - Fangqing Liu
- College of Life Sciences, Leshan Normal University, Leshan, 614000 China
- Key Laboratory of Sichuan Institute for Protecting Endangered Birds in the Southwest Mountains, Leshan Normal University, Leshan, 614000 China
| | - Longying Wen
- College of Life Sciences, Leshan Normal University, Leshan, 614000 China
- Key Laboratory of Sichuan Institute for Protecting Endangered Birds in the Southwest Mountains, Leshan Normal University, Leshan, 614000 China
| | - Xia Hu
- College of Life Sciences, Leshan Normal University, Leshan, 614000 China
| |
Collapse
|
3
|
Khamassi M, Girard B. Modeling awake hippocampal reactivations with model-based bidirectional search. BIOLOGICAL CYBERNETICS 2020; 114:231-248. [PMID: 32065253 DOI: 10.1007/s00422-020-00817-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Accepted: 01/21/2020] [Indexed: 06/10/2023]
Abstract
Hippocampal offline reactivations during reward-based learning, usually categorized as replay events, have been found to be important for performance improvement over time and for memory consolidation. Recent computational work has linked these phenomena to the need to transform reward information into state-action values for decision making and to propagate it to all relevant states of the environment. Nevertheless, it is still unclear whether an integrated reinforcement learning mechanism could account for the variety of awake hippocampal reactivations, including variety in order (forward and reverse reactivated trajectories) and variety in the location where they occur (reward site or decision-point). Here, we present a model-based bidirectional search model which accounts for a variety of hippocampal reactivations. The model combines forward trajectory sampling from current position and backward sampling through prioritized sweeping from states associated with large reward prediction errors until the two trajectories connect. This is repeated until stabilization of state-action values (convergence), which could explain why hippocampal reactivations drastically diminish when the animal's performance stabilizes. Simulations in a multiple T-maze task show that forward reactivations are prominently found at decision-points while backward reactivations are exclusively generated at reward sites. Finally, the model can generate imaginary trajectories that are not allowed to the agent during task performance. We raise some experimental predictions and implications for future studies of the role of the hippocampo-prefronto-striatal network in learning.
Collapse
Affiliation(s)
- Mehdi Khamassi
- Institute of Intelligent Systems and Robotics (ISIR), Sorbonne Université and CNRS (Centre National de la Recherche Scientifique), 75005, Paris, France.
| | - Benoît Girard
- Institute of Intelligent Systems and Robotics (ISIR), Sorbonne Université and CNRS (Centre National de la Recherche Scientifique), 75005, Paris, France
| |
Collapse
|
4
|
Leibold C. A model for navigation in unknown environments based on a reservoir of hippocampal sequences. Neural Netw 2020; 124:328-342. [DOI: 10.1016/j.neunet.2020.01.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 12/18/2019] [Accepted: 01/14/2020] [Indexed: 12/21/2022]
|
5
|
Liu X, Kuzum D. Hippocampal-Cortical Memory Trace Transfer and Reactivation Through Cell-Specific Stimulus and Spontaneous Background Noise. Front Comput Neurosci 2019; 13:67. [PMID: 31680922 PMCID: PMC6798041 DOI: 10.3389/fncom.2019.00067] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 09/10/2019] [Indexed: 01/07/2023] Open
Abstract
The hippocampus plays important roles in memory formation and retrieval through sharp-wave-ripples. Recent studies have shown that certain neuron populations in the prefrontal cortex (PFC) exhibit coordinated reactivations during awake ripple events. These experimental findings suggest that the awake ripple is an important biomarker, through which the hippocampus interacts with the neocortex to assist memory formation and retrieval. However, the computational mechanisms of this ripple based hippocampal-cortical coordination are still not clear due to the lack of unified models that include both the hippocampal and cortical networks. In this work, using a coupled biophysical model of both CA1 and PFC, we investigate possible mechanisms of hippocampal-cortical memory trace transfer and the conditions that assist reactivation of the transferred memory traces in the PFC. To validate our model, we first show that the local field potentials generated in the hippocampus and PFC exhibit ripple range activities that are consistent with the recent experimental studies. Then we demonstrate that during ripples, sequence replays can successfully transfer the information stored in the hippocampus to the PFC recurrent networks. We investigate possible mechanisms of memory retrieval in PFC networks. Our results suggest that the stored memory traces in the PFC network can be retrieved through two different mechanisms, namely the cell-specific input representing external stimuli and non-specific spontaneous background noise representing spontaneous memory recall events. Importantly, in both cases, the memory reactivation quality is robust to network connection loss. Finally, we find out that the quality of sequence reactivations is enhanced by both increased number of SWRs and an optimal background noise intensity, which tunes the excitability of neurons to a proper level. Our study presents a mechanistic explanation for the memory trace transfer from the hippocampus to neocortex through ripple coupling in awake states and reports two different mechanisms by which the stored memory traces can be successfully retrieved.
Collapse
Affiliation(s)
- Xin Liu
- Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA, United States
| | - Duygu Kuzum
- Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|
6
|
Drieu C, Zugaro M. Hippocampal Sequences During Exploration: Mechanisms and Functions. Front Cell Neurosci 2019; 13:232. [PMID: 31263399 PMCID: PMC6584963 DOI: 10.3389/fncel.2019.00232] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 05/08/2019] [Indexed: 12/13/2022] Open
Abstract
Although the hippocampus plays a critical role in spatial and episodic memories, the mechanisms underlying memory formation, stabilization, and recall for adaptive behavior remain relatively unknown. During exploration, within single cycles of the ongoing theta rhythm that dominates hippocampal local field potentials, place cells form precisely ordered sequences of activity. These neural sequences result from the integration of both external inputs conveying sensory-motor information, and intrinsic network dynamics possibly related to memory processes. Their endogenous replay during subsequent sleep is critical for memory consolidation. The present review discusses possible mechanisms and functions of hippocampal theta sequences during exploration. We present several lines of evidence suggesting that these neural sequences play a key role in information processing and support the formation of initial memory traces, and discuss potential functional distinctions between neural sequences emerging during theta vs. awake sharp-wave ripples.
Collapse
Affiliation(s)
- Céline Drieu
- Center for Interdisciplinary Research in Biology, Collège de France, CNRS UMR 7241, INSERM U 1050, PSL Research University, Paris, France
| | - Michaël Zugaro
- Center for Interdisciplinary Research in Biology, Collège de France, CNRS UMR 7241, INSERM U 1050, PSL Research University, Paris, France
| |
Collapse
|
7
|
Reboreda A, Theissen FM, Valero-Aracama MJ, Arboit A, Corbu MA, Yoshida M. Do TRPC channels support working memory? Comparing modulations of TRPC channels and working memory through G-protein coupled receptors and neuromodulators. Behav Brain Res 2018; 354:64-83. [PMID: 29501506 DOI: 10.1016/j.bbr.2018.02.042] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Revised: 02/27/2018] [Accepted: 02/27/2018] [Indexed: 12/11/2022]
Abstract
Working memory is a crucial ability we use in daily life. However, the cellular mechanisms supporting working memory still remain largely unclear. A key component of working memory is persistent neural firing which is believed to serve short-term (hundreds of milliseconds up to tens of seconds) maintenance of necessary information. In this review, we will focus on the role of transient receptor potential canonical (TRPC) channels as a mechanism underlying persistent firing. Many years of in vitro work have been suggesting a crucial role of TRPC channels in working memory and temporal association tasks. If TRPC channels are indeed a central mechanism for working memory, manipulations which impair or facilitate working memory should have a similar effect on TRPC channel modulation. However, modulations of working memory and TRPC channels were never systematically compared, and it remains unanswered whether TRPC channels indeed contribute to working memory in vivo or not. In this article, we review the effects of G-protein coupled receptors (GPCR) and neuromodulators, including acetylcholine, noradrenalin, serotonin and dopamine, on working memory and TRPC channels. Based on comparisons, we argue that GPCR and downstream signaling pathways that activate TRPC, generally support working memory, while those that suppress TRPC channels impair it. However, depending on the channel types, areas, and systems tested, this is not the case in all studies. Further work to clarify involvement of specific TRPC channels in working memory tasks and how they are affected by neuromodulators is still necessary in the future.
Collapse
Affiliation(s)
- Antonio Reboreda
- Leibniz Institute for Neurobiology (LIN) Magdeburg, Brenneckestraße 6, 39118 Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE) Magdeburg, Leipziger Str. 44/Haus 64, 39120, Magdeburg, Germany.
| | - Frederik M Theissen
- German Center for Neurodegenerative Diseases (DZNE) Magdeburg, Leipziger Str. 44/Haus 64, 39120, Magdeburg, Germany
| | - Maria J Valero-Aracama
- Institute of Physiology and Pathophysiology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Universitätsstraße 17, 91054 Erlangen, Germany
| | - Alberto Arboit
- German Center for Neurodegenerative Diseases (DZNE) Magdeburg, Leipziger Str. 44/Haus 64, 39120, Magdeburg, Germany
| | - Mihaela A Corbu
- Ruhr University Bochum (RUB), Universitätsstraße 150, 44801, Bochum, Germany
| | - Motoharu Yoshida
- Leibniz Institute for Neurobiology (LIN) Magdeburg, Brenneckestraße 6, 39118 Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE) Magdeburg, Leipziger Str. 44/Haus 64, 39120, Magdeburg, Germany; Center for Behavioral Brain Sciences, 39106, Magdeburg, Germany.
| |
Collapse
|
8
|
Hasselmo ME, Hinman JR, Dannenberg H, Stern CE. Models of spatial and temporal dimensions of memory. Curr Opin Behav Sci 2017; 17:27-33. [PMID: 29130060 DOI: 10.1016/j.cobeha.2017.05.024] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Episodic memory involves coding of the spatial location and time of individual events. Coding of space and time is also relevant to working memory, spatial navigation, and the disambiguation of overlapping memory representations. Neurophysiological data demonstrate that neuronal activity codes the current, past and future location of an animal as well as temporal intervals within a task. Models have addressed how neural coding of space and time for memory function could arise, with both dimensions coded by the same neurons. Neural coding could depend upon network oscillatory and attractor dynamics as well as modulation of neuronal intrinsic properties. These models are relevant to the coding of space and time involving structures including the hippocampus, entorhinal cortex, retrosplenial cortex, striatum and parahippocampal gyrus, which have been implicated in both animal and human studies.
Collapse
Affiliation(s)
- Michael E Hasselmo
- Center for Systems Neuroscience, Boston University, 610 Commonwealth Ave., Boston, MA 02215
| | - James R Hinman
- Center for Systems Neuroscience, Boston University, 610 Commonwealth Ave., Boston, MA 02215
| | - Holger Dannenberg
- Center for Systems Neuroscience, Boston University, 610 Commonwealth Ave., Boston, MA 02215
| | - Chantal E Stern
- Center for Systems Neuroscience, Boston University, 610 Commonwealth Ave., Boston, MA 02215
| |
Collapse
|
9
|
Prince LY, Bacon TJ, Tigaret CM, Mellor JR. Neuromodulation of the Feedforward Dentate Gyrus-CA3 Microcircuit. Front Synaptic Neurosci 2016; 8:32. [PMID: 27799909 PMCID: PMC5065980 DOI: 10.3389/fnsyn.2016.00032] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2016] [Accepted: 09/20/2016] [Indexed: 12/16/2022] Open
Abstract
The feedforward dentate gyrus-CA3 microcircuit in the hippocampus is thought to activate ensembles of CA3 pyramidal cells and interneurons to encode and retrieve episodic memories. The creation of these CA3 ensembles depends on neuromodulatory input and synaptic plasticity within this microcircuit. Here we review the mechanisms by which the neuromodulators aceylcholine, noradrenaline, dopamine, and serotonin reconfigure this microcircuit and thereby infer the net effect of these modulators on the processes of episodic memory encoding and retrieval.
Collapse
Affiliation(s)
- Luke Y Prince
- Centre for Synaptic Plasticity, School of Physiology, Pharmacology and Neuroscience, University of Bristol Bristol, UK
| | - Travis J Bacon
- Centre for Synaptic Plasticity, School of Physiology, Pharmacology and Neuroscience, University of Bristol Bristol, UK
| | - Cezar M Tigaret
- Centre for Synaptic Plasticity, School of Physiology, Pharmacology and Neuroscience, University of Bristol Bristol, UK
| | - Jack R Mellor
- Centre for Synaptic Plasticity, School of Physiology, Pharmacology and Neuroscience, University of Bristol Bristol, UK
| |
Collapse
|
10
|
Kitamura T, Macdonald CJ, Tonegawa S. Entorhinal-hippocampal neuronal circuits bridge temporally discontiguous events. ACTA ACUST UNITED AC 2015; 22:438-43. [PMID: 26286654 PMCID: PMC4561404 DOI: 10.1101/lm.038687.115] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 06/29/2015] [Indexed: 11/24/2022]
Abstract
The entorhinal cortex (EC)-hippocampal (HPC) network plays an essential role for episodic memory, which preserves spatial and temporal information about the occurrence of past events. Although there has been significant progress toward understanding the neural circuits underlying the spatial dimension of episodic memory, the relevant circuits subserving the temporal dimension are just beginning to be understood. In this review, we examine the evidence concerning the role of the EC in associating events separated by time--or temporal associative learning--with emphasis on the function of persistent activity in the medial entorhinal cortex layer III (MECIII) and their direct inputs into the CA1 region of HPC. We also discuss the unique role of Island cells in the medial entorhinal cortex layer II (MECII), which is a newly discovered direct feedforward inhibitory circuit to CA1. Finally, we relate the function of these entorhinal cortical circuits to recent findings concerning hippocampal time cells, which may collectively activate in sequence to bridge temporal gaps between discontiguous events in an episode.
Collapse
Affiliation(s)
- Takashi Kitamura
- RIKEN-MIT Center for Neural Circuit Genetics at the Picower Institute for Learning and Memory, Department of Biology and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Christopher J Macdonald
- RIKEN-MIT Center for Neural Circuit Genetics at the Picower Institute for Learning and Memory, Department of Biology and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Susumu Tonegawa
- RIKEN-MIT Center for Neural Circuit Genetics at the Picower Institute for Learning and Memory, Department of Biology and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Howard Hughes Medical Institute at MIT, Cambridge, Massachusetts 02139, USA
| |
Collapse
|