1
|
Hallquist MN, Hwang K, Luna B, Dombrovski AY. Reward-based option competition in human dorsal stream and transition from stochastic exploration to exploitation in continuous space. SCIENCE ADVANCES 2024; 10:eadj2219. [PMID: 38394198 PMCID: PMC10889364 DOI: 10.1126/sciadv.adj2219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 01/23/2024] [Indexed: 02/25/2024]
Abstract
Primates exploring and exploiting a continuous sensorimotor space rely on dynamic maps in the dorsal stream. Two complementary perspectives exist on how these maps encode rewards. Reinforcement learning models integrate rewards incrementally over time, efficiently resolving the exploration/exploitation dilemma. Working memory buffer models explain rapid plasticity of parietal maps but lack a plausible exploration/exploitation policy. The reinforcement learning model presented here unifies both accounts, enabling rapid, information-compressing map updates and efficient transition from exploration to exploitation. As predicted by our model, activity in human frontoparietal dorsal stream regions, but not in MT+, tracks the number of competing options, as preferred options are selectively maintained on the map, while spatiotemporally distant alternatives are compressed out. When valuable new options are uncovered, posterior β1/α oscillations desynchronize within 0.4 to 0.7 s, consistent with option encoding by competing β1-stabilized subpopulations. Together, outcomes matching locally cached reward representations rapidly update parietal maps, biasing choices toward often-sampled, rewarded options.
Collapse
Affiliation(s)
| | - Kai Hwang
- Department of Psychological and Brain Sciences, Iowa Neuroscience Institute, University of Iowa, Iowa City, IA, USA
| | - Beatriz Luna
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA
| | | |
Collapse
|
2
|
Nie H, Yu T, Zou Y, Li Y, Chen J, Xia J, Luo Q, Peng H. Effects of childhood maltreatment and major depressive disorder on functional connectivity in hippocampal subregions. Brain Imaging Behav 2024:10.1007/s11682-024-00859-w. [PMID: 38324083 DOI: 10.1007/s11682-024-00859-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 02/08/2024]
Abstract
Major Depressive Disorder (MDD) with childhood maltreatment is a prevalent clinical phenotype. Prior studies have observed abnormal hippocampal activity in MDD patients, considering the hippocampus as a single nucleus. However, there is limited research investigating the static and dynamic changes in hippocampal subregion functional connectivity (FC) in MDD patients with childhood maltreatment. Therefore, we employed static and dynamic FC analyses using hippocampal subregions, including the anterior hippocampus and posterior hippocampus, as seed regions to investigate the neurobiological alterations associated with MDD resulting from childhood maltreatment. This study involved four groups: MDD with (n = 48) and without childhood maltreatment (n = 30), as well as healthy controls with (n = 57) and without (n = 46) childhood maltreatment. Compared to MDD patients without childhood maltreatment, those with childhood maltreatment exhibit altered FC between the hippocampal subregion and multiple brain regions, including the anterior cingulate gyrus, superior frontal gyrus, putamen, calcarine gyrus, superior temporal gyrus, angular gyrus, and supplementary motor area. Additionally, dynamic FC between the right medial-2 hippocampal head and the right calcarine gyrus shows a positive correlation with childhood maltreatment across all its subtypes. Moreover, dFC between the right hippocampal tail and the left angular gyrus moderates the relationship between childhood maltreatment and the depression severity. Our findings of distinct FC patterns within hippocampal subregions provide new clues for understanding the neurobiological basis of MDD with childhood maltreatment.
Collapse
Affiliation(s)
- Huiqin Nie
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China
| | - Tong Yu
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China
| | - Yurong Zou
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China
| | - Yuhong Li
- Department of Publicity and Health Education, Shenzhen Longhua District Central Hospital, Shenzhen, 518000, China
| | - Juran Chen
- The Zhongshan Torch Hi-tech Industrial Development Zone Community Health Service, Zhongshan, 528437, China
| | - Jinrou Xia
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China
| | - Qianyi Luo
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China.
- Guangdong Engineering Technology Research Center for Translational Medicine of Mental Disorders, Guangzhou, 510370, China.
| | - Hongjun Peng
- Department of Clinical Psychology, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, 510370, China.
- Guangdong Engineering Technology Research Center for Translational Medicine of Mental Disorders, Guangzhou, 510370, China.
| |
Collapse
|
3
|
Wyatt LE, Hewan PA, Hogeveen J, Spreng RN, Turner GR. Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies. Neuropsychologia 2024; 192:108740. [PMID: 38036246 DOI: 10.1016/j.neuropsychologia.2023.108740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 10/15/2023] [Accepted: 11/21/2023] [Indexed: 12/02/2023]
Abstract
Thoughts and actions are often driven by a decision to either explore new avenues with unknown outcomes, or to exploit known options with predictable outcomes. Yet, the neural mechanisms underlying this exploration-exploitation trade-off in humans remain poorly understood. This is attributable to variability in the operationalization of exploration and exploitation as psychological constructs, as well as the heterogeneity of experimental protocols and paradigms used to study these choice behaviours. To address this gap, here we present a comprehensive review of the literature to investigate the neural basis of explore-exploit decision-making in humans. We first conducted a systematic review of functional magnetic resonance imaging (fMRI) studies of exploration-versus exploitation-based decision-making in healthy adult humans during foraging, reinforcement learning, and information search. Eleven fMRI studies met inclusion criterion for this review. Adopting a network neuroscience framework, synthesis of the findings across these studies revealed that exploration-based choice was associated with the engagement of attentional, control, and salience networks. In contrast, exploitation-based choice was associated with engagement of default network brain regions. We interpret these results in the context of a network architecture that supports the flexible switching between externally and internally directed cognitive processes, necessary for adaptive, goal-directed behaviour. To further investigate potential neural mechanisms underlying the exploration-exploitation trade-off we next surveyed studies involving neurodevelopmental, neuropsychological, and neuropsychiatric disorders, as well as lifespan development, and neurodegenerative diseases. We observed striking differences in patterns of explore-exploit decision-making across these populations, again suggesting that these two decision-making modes are supported by independent neural circuits. Taken together, our review highlights the need for precision-mapping of the neural circuitry and behavioural correlates associated with exploration and exploitation in humans. Characterizing exploration versus exploitation decision-making biases may offer a novel, trans-diagnostic approach to assessment, surveillance, and intervention for cognitive decline and dysfunction in normal development and clinical populations.
Collapse
Affiliation(s)
- Lindsay E Wyatt
- Department of Psychology, York University, Toronto, ON, Canada
| | - Patrick A Hewan
- Department of Psychology, York University, Toronto, ON, Canada
| | - Jeremy Hogeveen
- Department of Psychology, The University of New Mexico, Albuquerque, NM, USA
| | - R Nathan Spreng
- Montréal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montréal, QC, H3A 2B4, Canada; Department of Psychology, McGill University, Montréal, QC, Canada; Department of Psychiatry, McGill University, Montréal, QC, Canada; McConnell Brain Imaging Centre, Montréal Neurological Institute, McGill University, Montréal, QC, Canada.
| | - Gary R Turner
- Department of Psychology, York University, Toronto, ON, Canada.
| |
Collapse
|
4
|
Speers LJ, Bilkey DK. Maladaptive explore/exploit trade-offs in schizophrenia. Trends Neurosci 2023; 46:341-354. [PMID: 36878821 DOI: 10.1016/j.tins.2023.02.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 01/30/2023] [Accepted: 02/08/2023] [Indexed: 03/07/2023]
Abstract
Schizophrenia is a complex disorder that remains poorly understood, particularly at the systems level. In this opinion article we argue that the explore/exploit trade-off concept provides a holistic and ecologically valid framework to resolve some of the apparent paradoxes that have emerged within schizophrenia research. We review recent evidence suggesting that fundamental explore/exploit behaviors may be maladaptive in schizophrenia during physical, visual, and cognitive foraging. We also describe how theories from the broader optimal foraging literature, such as the marginal value theorem (MVT), could provide valuable insight into how aberrant processing of reward, context, and cost/effort evaluations interact to produce maladaptive responses.
Collapse
Affiliation(s)
- Lucinda J Speers
- Department of Psychology, University of Otago, Dunedin 9016, New Zealand
| | - David K Bilkey
- Department of Psychology, University of Otago, Dunedin 9016, New Zealand.
| |
Collapse
|
5
|
Spreng RN, Turner GR. From exploration to exploitation: a shifting mental mode in late life development. Trends Cogn Sci 2021; 25:1058-1071. [PMID: 34593321 PMCID: PMC8844884 DOI: 10.1016/j.tics.2021.09.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 08/30/2021] [Accepted: 09/01/2021] [Indexed: 12/31/2022]
Abstract
Changes in cognition, affect, and brain function combine to promote a shift in the nature of mentation in older adulthood, favoring exploitation of prior knowledge over exploratory search as the starting point for thought and action. Age-related exploitation biases result from the accumulation of prior knowledge, reduced cognitive control, and a shift toward affective goals. These are accompanied by changes in cortical networks, as well as attention and reward circuits. By incorporating these factors into a unified account, the exploration-to-exploitation shift offers an integrative model of cognitive, affective, and brain aging. Here, we review evidence for this model, identify determinants and consequences, and survey the challenges and opportunities posed by an exploitation-biased mental mode in later life.
Collapse
Affiliation(s)
- R Nathan Spreng
- Laboratory of Brain and Cognition, Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC H3A 2B4, Canada; McConnell Brain Imaging Centre, Montreal Neurological Institute, McGill University, Montreal, QC H3A 2B4, Canada; Departments of Psychiatry and Psychology, McGill University, Montreal, QC H3A 0G4, Canada.
| | - Gary R Turner
- Department of Psychology, York University, Toronto, ON M3J 1P3, Canada
| |
Collapse
|
6
|
Castilla A, Borst G, Cohen D, Fradin J, Lefrançois C, Houdé O, Zaoui M, Berthoz A. A New Paradigm for the Study of Cognitive Flexibility in Children and Adolescents: The "Virtual House Locomotor Maze" (VHLM). Front Psychiatry 2021; 12:708378. [PMID: 34630176 PMCID: PMC8495412 DOI: 10.3389/fpsyt.2021.708378] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 08/26/2021] [Indexed: 01/10/2023] Open
Abstract
Classical neuropsychological assessments are designed to explore cognitive brain functions using paper-and-pencil or digital tests. The purpose of this study was to design and to test a new protocol named the "Virtual House Locomotor Maze" (VHLM) for studying inhibitory control as well as mental flexibility using a visuo-spatial locomotor memory test. The VHLM is a simple maze including six houses using the technology of the Virtual Carpet Paradigm™. Ten typical development children (TD) were enrolled in this study. The participants were instructed to reach a target house as quickly as possible and to bear in mind the experimental instructions. We examined their planning and replanning abilities to take the shortest path to reach a target house. In order to study the cognitive processes during navigation, we implemented a spatio-temporal index based on the measure of kinematics behaviors (i.e., trajectories, tangential velocity and head direction). Replanning was tested by first repeating a path chosen by the subject to reach a given house. After learning this path, it was blocked imposing that the subject inhibited the learned trajectory and designed a new trajectory to reach the same house. We measured the latency of the departure after the presentation of each house and the initial direction of the trajectory. The results suggest that several strategies are used by the subjects for replanning and our measures could be used as an index of impulsivity.
Collapse
Affiliation(s)
- Alexander Castilla
- Université de Paris, LaPsyDÉ, CNRS, Paris, France
- Laboratoire de Psychologie et de Neurosciences, Institut de Médecine Environnementale (IME), Paris, France
- Centre Interdisciplinaire de recherche en Biologie (CIRB), Collège de France, Paris, France
| | - Gregoire Borst
- Université de Paris, LaPsyDÉ, CNRS, Paris, France
- Institut Universitaire de France (IUF), Paris, France
| | - David Cohen
- Département de Psychiatrie de l'Enfant et de l'Adolescent, AP-HP, Hôpital Pitié-Salpêtrière, and Institut des Systèmes Intelligents et de Robotiques, Sorbonne Université, Paris, France
| | - Jacques Fradin
- Laboratoire de Psychologie et de Neurosciences, Institut de Médecine Environnementale (IME), Paris, France
| | - Camille Lefrançois
- Laboratoire de Psychologie et de Neurosciences, Institut de Médecine Environnementale (IME), Paris, France
| | - Olivier Houdé
- Université de Paris, LaPsyDÉ, CNRS, Paris, France
- Institut Universitaire de France (IUF), Paris, France
| | - Mohamed Zaoui
- Centre Interdisciplinaire de recherche en Biologie (CIRB), Collège de France, Paris, France
| | - Alain Berthoz
- Centre Interdisciplinaire de recherche en Biologie (CIRB), Collège de France, Paris, France
| |
Collapse
|
7
|
Dombrovski AY, Hallquist MN. Search for solutions, learning, simulation, and choice processes in suicidal behavior. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2021; 13:e1561. [PMID: 34008338 PMCID: PMC9285563 DOI: 10.1002/wcs.1561] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 03/06/2021] [Accepted: 04/07/2021] [Indexed: 12/25/2022]
Abstract
Suicide may be viewed as an unfortunate outcome of failures in decision processes. Such failures occur when the demands of a crisis exceed a person's capacity to (i) search for options, (ii) learn and simulate possible futures, and (iii) make advantageous value‐based choices. Can individual‐level decision deficits and biases drive the progression of the suicidal crisis? Our overview of the evidence on this question is informed by clinical theory and grounded in reinforcement learning and behavioral economics. Cohort and case–control studies provide strong evidence that limited cognitive capacity and particularly impaired cognitive control are associated with suicidal behavior, imposing cognitive constraints on decision‐making. We conceptualize suicidal ideation as an element of impoverished consideration sets resulting from a search for solutions under cognitive constraints and mood‐congruent Pavlovian influences, a view supported by mostly indirect evidence. More compelling is the evidence of impaired learning in people with a history of suicidal behavior. We speculate that an inability to simulate alternative futures using one's model of the world may undermine alternative solutions in a suicidal crisis. The hypothesis supported by the strongest evidence is that the selection of suicide over alternatives is facilitated by a choice process undermined by randomness. Case–control studies using gambling tasks, armed bandits, and delay discounting support this claim. Future experimental studies will need to uncover real‐time dynamics of choice processes in suicidal people. In summary, the decision process framework sheds light on neurocognitive mechanisms that facilitate the progression of the suicidal crisis. This article is categorized under:Economics > Individual Decision‐Making Psychology > Emotion and Motivation Psychology > Learning Neuroscience > Behavior
Collapse
Affiliation(s)
| | - Michael N Hallquist
- Department of Psychology and Neuroscience, University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|