1
|
Maggi S, Hock RM, O'Neill M, Buckley M, Moran PM, Bast T, Sami M, Humphries MD. Tracking subjects' strategies in behavioural choice experiments at trial resolution. eLife 2024; 13:e86491. [PMID: 38426402 PMCID: PMC10959529 DOI: 10.7554/elife.86491] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Accepted: 02/23/2024] [Indexed: 03/02/2024] Open
Abstract
Investigating how, when, and what subjects learn during decision-making tasks requires tracking their choice strategies on a trial-by-trial basis. Here, we present a simple but effective probabilistic approach to tracking choice strategies at trial resolution using Bayesian evidence accumulation. We show this approach identifies both successful learning and the exploratory strategies used in decision tasks performed by humans, non-human primates, rats, and synthetic agents. Both when subjects learn and when rules change the exploratory strategies of win-stay and lose-shift, often considered complementary, are consistently used independently. Indeed, we find the use of lose-shift is strong evidence that subjects have latently learnt the salient features of a new rewarded rule. Our approach can be extended to any discrete choice strategy, and its low computational cost is ideally suited for real-time analysis and closed-loop control.
Collapse
Affiliation(s)
- Silvia Maggi
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| | - Rebecca M Hock
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| | - Martin O'Neill
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Health & Nutritional Sciences, Atlantic Technological UniversitySligoIreland
| | - Mark Buckley
- Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| | - Paula M Moran
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Neuroscience, University of NottinghamNottinghamUnited Kingdom
| | - Tobias Bast
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Neuroscience, University of NottinghamNottinghamUnited Kingdom
| | - Musa Sami
- Institute of Mental Health, University of NottinghamNottinghamUnited Kingdom
| | - Mark D Humphries
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| |
Collapse
|
2
|
Sheynikhovich D, Otani S, Bai J, Arleo A. Long-term memory, synaptic plasticity and dopamine in rodent medial prefrontal cortex: Role in executive functions. Front Behav Neurosci 2023; 16:1068271. [PMID: 36710953 PMCID: PMC9875091 DOI: 10.3389/fnbeh.2022.1068271] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 12/26/2022] [Indexed: 01/12/2023] Open
Abstract
Mnemonic functions, supporting rodent behavior in complex tasks, include both long-term and (short-term) working memory components. While working memory is thought to rely on persistent activity states in an active neural network, long-term memory and synaptic plasticity contribute to the formation of the underlying synaptic structure, determining the range of possible states. Whereas, the implication of working memory in executive functions, mediated by the prefrontal cortex (PFC) in primates and rodents, has been extensively studied, the contribution of long-term memory component to these tasks received little attention. This review summarizes available experimental data and theoretical work concerning cellular mechanisms of synaptic plasticity in the medial region of rodent PFC and the link between plasticity, memory and behavior in PFC-dependent tasks. A special attention is devoted to unique properties of dopaminergic modulation of prefrontal synaptic plasticity and its contribution to executive functions.
Collapse
Affiliation(s)
- Denis Sheynikhovich
- Sorbonne Université, INSERM, CNRS, Institut de la Vision, Paris, France,*Correspondence: Denis Sheynikhovich ✉
| | - Satoru Otani
- Sorbonne Université, INSERM, CNRS, Institut de la Vision, Paris, France
| | - Jing Bai
- Institute of Psychiatry and Neuroscience of Paris, INSERM U1266, Paris, France
| | - Angelo Arleo
- Sorbonne Université, INSERM, CNRS, Institut de la Vision, Paris, France
| |
Collapse
|
3
|
Yang Y, Kocher SD, Lewis MM, Mailman RB. Dose-Dependent Regulation on Prefrontal Neuronal Working Memory by Dopamine D1 Agonists: Evidence of Receptor Functional Selectivity-Related Mechanisms. Front Neurosci 2022; 16:898051. [PMID: 35784852 PMCID: PMC9244699 DOI: 10.3389/fnins.2022.898051] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 05/31/2022] [Indexed: 11/13/2022] Open
Abstract
Low doses of dopamine D1 agonists improve working memory-related behavior, but high doses eliminate the improvement, thus yielding an ‘inverted-U’ dose-response curve. This dose-dependency also occurs at the single neuron level in the prefrontal cortex where the cellular basis of working memory is represented. Because signaling mechanisms are unclear, we examined this process at the neuron population level. Two D1 agonists (2-methyldihydrexidine and CY208,243) having different signaling bias were tested in rats performing a spatial working memory-related T-maze task. 2-Methyldihydrexidine is slightly bias toward D1-mediated β-arrestin-related signaling as it is a full agonist at adenylate cyclase and a super-agonist at β-arrestin recruitment, whereas CY208,243 is slightly bias toward D1-mediated cAMP signaling as it has relatively high intrinsic activity at adenylate cyclase, but is a partial agonist at β-arrestin recruitment. Both compounds had the expected inverted U dose-dependency in modulating prefrontal neuronal activities, albeit with important differences. Although CY208,243 was superior in improving the strength of neuronal outcome sensitivity to the working memory-related choice behavior in the T-maze, 2-methyldihydrexidine better reduced neuron-to-neuron variation. Interestingly, at the neuron population level, both drugs affected the percentage, uniformity, and ensemble strength of neuronal sensitivity in a complicated dose-dependent fashion, but the overall effect suggested higher efficiency and potency of 2-methyldihydrexidine compared to CY208,243. The differences between 2-methyldihydrexidine and CY208,243 may be related to their specific D1 signaling. These results suggest that D1-related dose-dependent regulation of working memory can be modified differentially by functionally selective ligands, theoretically increasing the balance between desired and undesired effects.
Collapse
Affiliation(s)
- Yang Yang
- Department of Pharmacology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Translational Brain Research Center, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- *Correspondence: Yang Yang,
| | - Susan D. Kocher
- Department of Pharmacology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
| | - Mechelle M. Lewis
- Department of Pharmacology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Translational Brain Research Center, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Department of Neurology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
| | - Richard B. Mailman
- Department of Pharmacology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Translational Brain Research Center, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Department of Neurology, Penn State Milton S. Hershey Medical Center, Penn State College of Medicine, Hershey, PA, United States
- Richard B. Mailman,
| |
Collapse
|
4
|
Maggi S, Humphries MD. Activity Subspaces in Medial Prefrontal Cortex Distinguish States of the World. J Neurosci 2022; 42:4131-4146. [PMID: 35422440 PMCID: PMC9121833 DOI: 10.1523/jneurosci.1412-21.2022] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 12/15/2021] [Accepted: 01/13/2022] [Indexed: 11/23/2022] Open
Abstract
Medial prefrontal cortex (mPfC) activity represents information about the state of the world, including present behavior, such as decisions, and the immediate past, such as short-term memory. Unknown is whether information about different states of the world are represented in the same mPfC neural population and, if so, how they are kept distinct. To address this, we analyze here mPfC population activity of male rats learning rules in a Y-maze, with self-initiated choice trials to an arm end followed by a self-paced return during the intertrial interval (ITI). We find that trial and ITI population activity from the same population fall into different low-dimensional subspaces. These subspaces encode different states of the world: multiple features of the task can be decoded from both trial and ITI activity, but the decoding axes for the same feature are roughly orthogonal between the two task phases, and the decodings are predominantly of features of the present during the trial but features of the preceding trial during the ITI. These subspace distinctions are carried forward into sleep, where population activity is preferentially reactivated in post-training sleep but differently for activity from the trial and ITI subspaces. Our results suggest that the problem of interference when representing different states of the world is solved in mPfC by population activity occupying different subspaces for the world states, which can be independently decoded by downstream targets and independently addressed by upstream inputs.SIGNIFICANCE STATEMENT Activity in the medial prefrontal cortex plays a role in representing the current and past states of the world. We show that during a maze task, the activity of a single population in medial prefrontal cortex represents at least two different states of the world. These representations were sequential and sufficiently distinct that a downstream population could separately read out either state from that activity. Moreover, the activity representing different states is differently reactivated in sleep. Different world states can thus be represented in the same medial prefrontal cortex population but in such a way that prevents potentially catastrophic interference between them.
Collapse
Affiliation(s)
- Silvia Maggi
- School of Psychology, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Mark D Humphries
- School of Psychology, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| |
Collapse
|
5
|
Rule ME, O'Leary T. Self-healing codes: How stable neural populations can track continually reconfiguring neural representations. Proc Natl Acad Sci U S A 2022; 119:e2106692119. [PMID: 35145024 PMCID: PMC8851551 DOI: 10.1073/pnas.2106692119] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 12/29/2021] [Indexed: 12/19/2022] Open
Abstract
As an adaptive system, the brain must retain a faithful representation of the world while continuously integrating new information. Recent experiments have measured population activity in cortical and hippocampal circuits over many days and found that patterns of neural activity associated with fixed behavioral variables and percepts change dramatically over time. Such "representational drift" raises the question of how malleable population codes can interact coherently with stable long-term representations that are found in other circuits and with relatively rigid topographic mappings of peripheral sensory and motor signals. We explore how known plasticity mechanisms can allow single neurons to reliably read out an evolving population code without external error feedback. We find that interactions between Hebbian learning and single-cell homeostasis can exploit redundancy in a distributed population code to compensate for gradual changes in tuning. Recurrent feedback of partially stabilized readouts could allow a pool of readout cells to further correct inconsistencies introduced by representational drift. This shows how relatively simple, known mechanisms can stabilize neural tuning in the short term and provides a plausible explanation for how plastic neural codes remain integrated with consolidated, long-term representations.
Collapse
Affiliation(s)
- Michael E Rule
- Engineering Department, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| | - Timothy O'Leary
- Engineering Department, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| |
Collapse
|
6
|
Hennig JA, Oby ER, Losey DM, Batista AP, Yu BM, Chase SM. How learning unfolds in the brain: toward an optimization view. Neuron 2021; 109:3720-3735. [PMID: 34648749 PMCID: PMC8639641 DOI: 10.1016/j.neuron.2021.09.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/25/2021] [Accepted: 09/02/2021] [Indexed: 12/17/2022]
Abstract
How do changes in the brain lead to learning? To answer this question, consider an artificial neural network (ANN), where learning proceeds by optimizing a given objective or cost function. This "optimization framework" may provide new insights into how the brain learns, as many idiosyncratic features of neural activity can be recapitulated by an ANN trained to perform the same task. Nevertheless, there are key features of how neural population activity changes throughout learning that cannot be readily explained in terms of optimization and are not typically features of ANNs. Here we detail three of these features: (1) the inflexibility of neural variability throughout learning, (2) the use of multiple learning processes even during simple tasks, and (3) the presence of large task-nonspecific activity changes. We propose that understanding the role of these features in the brain will be key to describing biological learning using an optimization framework.
Collapse
Affiliation(s)
- Jay A Hennig
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA; Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA, USA.
| | - Emily R Oby
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Darby M Losey
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA; Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Aaron P Batista
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Byron M Yu
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA; Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, USA; Department of Biomedical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Steven M Chase
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA; Center for the Neural Basis of Cognition, Pittsburgh, PA, USA; Department of Biomedical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
7
|
Coordinated Prefrontal State Transition Leads Extinction of Reward-Seeking Behaviors. J Neurosci 2021; 41:2406-2419. [PMID: 33531416 DOI: 10.1523/jneurosci.2588-20.2021] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 12/16/2020] [Accepted: 01/17/2021] [Indexed: 11/21/2022] Open
Abstract
Extinction learning suppresses conditioned reward responses and is thus fundamental to adapt to changing environmental demands and to control excessive reward seeking. The medial prefrontal cortex (mPFC) monitors and controls conditioned reward responses. Abrupt transitions in mPFC activity anticipate changes in conditioned responses to altered contingencies. It remains, however, unknown whether such transitions are driven by the extinction of old behavioral strategies or by the acquisition of new competing ones. Using in vivo multiple single-unit recordings of mPFC in male rats, we studied the relationship between single-unit and population dynamics during extinction learning, using alcohol as a positive reinforcer in an operant conditioning paradigm. To examine the fine temporal relation between neural activity and behavior, we developed a novel behavioral model that allowed us to identify the number, onset, and duration of extinction-learning episodes in the behavior of each animal. We found that single-unit responses to conditioned stimuli changed even under stable experimental conditions and behavior. However, when behavioral responses to task contingencies had to be updated, unit-specific modulations became coordinated across the whole population, pushing the network into a new stable attractor state. Thus, extinction learning is not associated with suppressed mPFC responses to conditioned stimuli, but is anticipated by single-unit coordination into population-wide transitions of the internal state of the animal.SIGNIFICANCE STATEMENT The ability to suppress conditioned behaviors when no longer beneficial is fundamental for the survival of any organism. While pharmacological and optogenetic interventions have shown a critical involvement of the mPFC in the suppression of conditioned responses, the neural dynamics underlying such a process are still largely unknown. Combining novel analysis tools to describe behavior, single-neuron response, and population activity, we found that widespread changes in neuronal firing temporally coordinate across the whole mPFC population in anticipation of behavioral extinction. This coordination leads to a global transition in the internal state of the network, driving extinction of conditioned behavior.
Collapse
|
8
|
Rule ME, Loback AR, Raman DV, Driscoll LN, Harvey CD, O'Leary T. Stable task information from an unstable neural population. eLife 2020; 9:51121. [PMID: 32660692 PMCID: PMC7392606 DOI: 10.7554/elife.51121] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 06/17/2020] [Indexed: 02/06/2023] Open
Abstract
Over days and weeks, neural activity representing an animal's position and movement in sensorimotor cortex has been found to continually reconfigure or 'drift' during repeated trials of learned tasks, with no obvious change in behavior. This challenges classical theories, which assume stable engrams underlie stable behavior. However, it is not known whether this drift occurs systematically, allowing downstream circuits to extract consistent information. Analyzing long-term calcium imaging recordings from posterior parietal cortex in mice (Mus musculus), we show that drift is systematically constrained far above chance, facilitating a linear weighted readout of behavioral variables. However, a significant component of drift continually degrades a fixed readout, implying that drift is not confined to a null coding space. We calculate the amount of plasticity required to compensate drift independently of any learning rule, and find that this is within physiologically achievable bounds. We demonstrate that a simple, biologically plausible local learning rule can achieve these bounds, accurately decoding behavior over many days.
Collapse
Affiliation(s)
- Michael E Rule
- Department of Engineering, University of Cambridge, Cambridge, United Kingdom
| | - Adrianna R Loback
- Department of Engineering, University of Cambridge, Cambridge, United Kingdom
| | - Dhruva V Raman
- Department of Engineering, University of Cambridge, Cambridge, United Kingdom
| | - Laura N Driscoll
- Department of Electrical Engineering, Stanford University, Stanford, United States
| | | | - Timothy O'Leary
- Department of Engineering, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
9
|
Woon EP, Sequeira MK, Barbee BR, Gourley SL. Involvement of the rodent prelimbic and medial orbitofrontal cortices in goal-directed action: A brief review. J Neurosci Res 2020; 98:1020-1030. [PMID: 31820488 PMCID: PMC7392403 DOI: 10.1002/jnr.24567] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 10/13/2019] [Accepted: 11/15/2019] [Indexed: 01/15/2023]
Abstract
Goal-directed action refers to selecting behaviors based on the expectation that they will be reinforced with desirable outcomes. It is typically conceptualized as opposing habit-based behaviors, which are instead supported by stimulus-response associations and insensitive to consequences. The prelimbic prefrontal cortex (PL) is positioned along the medial wall of the rodent prefrontal cortex. It is indispensable for action-outcome-driven (goal-directed) behavior, consolidating action-outcome relationships and linking contextual information with instrumental behavior. In this brief review, we will discuss the growing list of molecular factors involved in PL function. Ventral to the PL is the medial orbitofrontal cortex (mOFC). We will also summarize emerging evidence from rodents (complementing existing literature describing humans) that it too is involved in action-outcome conditioning. We describe experiments using procedures that quantify responding based on reward value, the likelihood of reinforcement, or effort requirements, touching also on experiments assessing food consumption more generally. We synthesize these findings with the argument that the mOFC is essential to goal-directed action when outcome value information is not immediately observable and must be recalled and inferred.
Collapse
Affiliation(s)
- Ellen P. Woon
- Graduate Program in Neuroscience
- Yerkes National Primate Research Center, Departments of Pediatrics and Psychiatry and Behavioral Sciences, Center for Translational and Social Neuroscience
| | - Michelle K. Sequeira
- Graduate Program in Neuroscience
- Yerkes National Primate Research Center, Departments of Pediatrics and Psychiatry and Behavioral Sciences, Center for Translational and Social Neuroscience
| | - Britton R. Barbee
- Yerkes National Primate Research Center, Departments of Pediatrics and Psychiatry and Behavioral Sciences, Center for Translational and Social Neuroscience
- Graduate Program in Molecular and Systems Pharmacology Emory University, Atlanta, GA
| | - Shannon L. Gourley
- Graduate Program in Neuroscience
- Yerkes National Primate Research Center, Departments of Pediatrics and Psychiatry and Behavioral Sciences, Center for Translational and Social Neuroscience
- Graduate Program in Molecular and Systems Pharmacology Emory University, Atlanta, GA
| |
Collapse
|