Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kolling N, Akam T. (Reinforcement?) Learning to forage optimally. Curr Opin Neurobiol 2017;46:162-169. [PMID: 28918312 DOI: 10.1016/j.conb.2017.08.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 08/06/2017] [Accepted: 08/17/2017] [Indexed: 11/24/2022]

For:	Kolling N, Akam T. (Reinforcement?) Learning to forage optimally. Curr Opin Neurobiol 2017;46:162-169. [PMID: 28918312 DOI: 10.1016/j.conb.2017.08.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 08/06/2017] [Accepted: 08/17/2017] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Hong I, Wolfe JM. Research on re-searching: interrupted foraging is not disrupted foraging. Cogn Res Princ Implic 2024;9:30. [PMID: 38748189 PMCID: PMC11096138 DOI: 10.1186/s41235-024-00556-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Accepted: 04/26/2024] [Indexed: 05/18/2024] Open

Alejandro RJ, Holroyd CB. Hierarchical control over foraging behavior by anterior cingulate cortex. Neurosci Biobehav Rev 2024;160:105623. [PMID: 38490499 DOI: 10.1016/j.neubiorev.2024.105623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 02/14/2024] [Accepted: 03/13/2024] [Indexed: 03/17/2024]

Lloyd A, Viding E, McKay R, Furl N. Understanding patch foraging strategies across development. Trends Cogn Sci 2023;27:1085-1098. [PMID: 37500422 DOI: 10.1016/j.tics.2023.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 07/05/2023] [Accepted: 07/06/2023] [Indexed: 07/29/2023]

Garcia M, Gupta S, Wikenheiser AM. Sex differences in patch-leaving foraging decisions in rats. OXFORD OPEN NEUROSCIENCE 2023;2:kvad011. [PMID: 38596244 PMCID: PMC11003400 DOI: 10.1093/oons/kvad011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 10/11/2023] [Accepted: 10/12/2023] [Indexed: 04/11/2024]

Garcia M, Gupta S, Wikenheiser AM. Sex differences in patch-leaving foraging decisions in rats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.19.529135. [PMID: 36824852 PMCID: PMC9949151 DOI: 10.1101/2023.02.19.529135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]

Qadri MAJ, Cook RG. Learning and organization of within-session sequences by pigeons (Columba livia). Anim Cogn 2023;26:1571-1587. [PMID: 37335435 DOI: 10.1007/s10071-023-01801-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/21/2023]

Lin HY, von Helversen B. Never Gonna Give You Up Even When It Is Suboptimal. Cogn Sci 2023;47:e13323. [PMID: 37486808 DOI: 10.1111/cogs.13323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 05/26/2023] [Accepted: 06/30/2023] [Indexed: 07/26/2023]

Harhen NC, Bornstein AM. Overharvesting in human patch foraging reflects rational structure learning and adaptive planning. Proc Natl Acad Sci U S A 2023;120:e2216524120. [PMID: 36961923 PMCID: PMC10068834 DOI: 10.1073/pnas.2216524120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 02/11/2023] [Indexed: 03/26/2023] Open

Young ME, Howatt BC. Resource limitations: A taxonomy. Behav Processes 2023;206:104823. [PMID: 36682436 DOI: 10.1016/j.beproc.2023.104823] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 01/02/2023] [Accepted: 01/17/2023] [Indexed: 01/21/2023]

Whelan MT, Jimenez-Rodriguez A, Prescott TJ, Vasilaki E. A robotic model of hippocampal reverse replay for reinforcement learning. BIOINSPIRATION & BIOMIMETICS 2022;18:015007. [PMID: 36327454 DOI: 10.1088/1748-3190/ac9ffc] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 11/03/2022] [Indexed: 06/16/2023]

Davis GH, Crofoot MC, Farine DR. Using optimal foraging theory to infer how groups make collective decisions. Trends Ecol Evol 2022;37:942-952. [PMID: 35842325 DOI: 10.1016/j.tree.2022.06.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/17/2022] [Accepted: 06/20/2022] [Indexed: 12/23/2022]

Scholl J, Trier HA, Rushworth MFS, Kolling N. The effect of apathy and compulsivity on planning and stopping in sequential decision-making. PLoS Biol 2022;20:e3001566. [PMID: 35358177 PMCID: PMC8970514 DOI: 10.1371/journal.pbio.3001566] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 02/03/2022] [Indexed: 11/21/2022] Open

Multi-step planning in the brain. Curr Opin Behav Sci 2021. [DOI: 10.1016/j.cobeha.2020.07.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Exploration: from machines to humans. Curr Opin Behav Sci 2020. [DOI: 10.1016/j.cobeha.2020.08.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Emberly E, Seamans JK. Abrupt, Asynchronous Changes in Action Representations by Anterior Cingulate Cortex Neurons during Trial and Error Learning. Cereb Cortex 2020;30:4336-4345. [PMID: 32239139 DOI: 10.1093/cercor/bhaa019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 01/09/2020] [Accepted: 01/12/2020] [Indexed: 11/13/2022] Open

Gabay AS, Apps MAJ. Foraging optimally in social neuroscience: computations and methodological considerations. Soc Cogn Affect Neurosci 2020;16:782-794. [PMID: 32232360 PMCID: PMC8343566 DOI: 10.1093/scan/nsaa037] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 01/29/2020] [Accepted: 03/25/2020] [Indexed: 12/18/2022] Open

Foraging for foundations in decision neuroscience: insights from ethology. Nat Rev Neurosci 2019;19:419-427. [PMID: 29752468 DOI: 10.1038/s41583-018-0010-7] [Citation(s) in RCA: 93] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Davidson JD, El Hady A. Foraging as an evidence accumulation process. PLoS Comput Biol 2019;15:e1007060. [PMID: 31339878 PMCID: PMC6682163 DOI: 10.1371/journal.pcbi.1007060] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 08/05/2019] [Accepted: 04/30/2019] [Indexed: 11/21/2022] Open

Abstract

The patch-leaving problem is a canonical foraging task, in which a forager must decide to leave a current resource in search for another. Theoretical work has derived optimal strategies for when to leave a patch, and experiments have tested for conditions where animals do or do not follow an optimal strategy. Nevertheless, models of patch-leaving decisions do not consider the imperfect and noisy sampling process through which an animal gathers information, and how this process is constrained by neurobiological mechanisms. In this theoretical study, we formulate an evidence accumulation model of patch-leaving decisions where the animal averages over noisy measurements to estimate the state of the current patch and the overall environment. We solve the model for conditions where foraging decisions are optimal and equivalent to the marginal value theorem, and perform simulations to analyze deviations from optimal when these conditions are not met. By adjusting the drift rate and decision threshold, the model can represent different “strategies”, for example an incremental, decremental, or counting strategy. These strategies yield identical decisions in the limiting case but differ in how patch residence times adapt when the foraging environment is uncertain. To describe sub-optimal decisions, we introduce an energy-dependent marginal utility function that predicts longer than optimal patch residence times when food is plentiful. Our model provides a quantitative connection between ecological models of foraging behavior and evidence accumulation models of decision making. Moreover, it provides a theoretical framework for potential experiments which seek to identify neural circuits underlying patch-leaving decisions.

Foraging is a ubiquitous animal behavior, performed by organisms as different as worms, birds, rats, and humans. Although the behavior has been extensively studied, it is not known how the brain processes information obtained during foraging activity to make subsequent foraging decisions. We form an evidence accumulation model of foraging decisions that describes the process through which an animal gathers information and uses it to make foraging decisions. By building on studies of the neural decision mechanisms within systems neuroscience, this model connects the foraging decision process with ecological models of patch-leaving decisions, such as the marginal value theorem. The model suggests the existence of different foraging strategies, which optimize for different environmental conditions and their potential implementation by neural decision making circuits. The model also shows how state-dependence, such as satiation level, can affect evidence accumulation to lead to sub-optimal foraging decisions. Our model provides a framework for future experimental studies which seek to elucidate how neural decision making mechanisms have been shaped by evolutionary forces in an animal’s surrounding environment.

Collapse

Foraging decisions as multi-armed bandit problems: Applying reinforcement learning algorithms to foraging data. J Theor Biol 2019;467:48-56. [PMID: 30735736 DOI: 10.1016/j.jtbi.2019.02.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 02/01/2019] [Accepted: 02/05/2019] [Indexed: 12/16/2022]

Maya C, Rosetti MF, Pacheco-Cobos L, Hudson R. Human Foragers: Searchers by Nature and Experience. EVOLUTIONARY PSYCHOLOGY 2019;17:1474704919839729. [PMID: 31010326 PMCID: PMC10358407 DOI: 10.1177/1474704919839729] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Accepted: 02/12/2019] [Indexed: 11/16/2022] Open

Hall-McMaster S, Luyckx F. Revisiting foraging approaches in neuroscience. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2019;19:225-230. [PMID: 30607832 PMCID: PMC6420423 DOI: 10.3758/s13415-018-00682-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Ramakrishnan A, Hayden BY, Platt ML. Local field potentials in dorsal anterior cingulate sulcus reflect rewards but not travel time costs during foraging. Brain Neurosci Adv 2019;3:2398212818817932. [PMID: 32166176 PMCID: PMC7058217 DOI: 10.1177/2398212818817932] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 11/12/2018] [Indexed: 11/16/2022] Open

Schulz E, Wu CM, Huys QJM, Krause A, Speekenbrink M. Generalization and Search in Risky Environments. Cogn Sci 2018;42:2592-2620. [DOI: 10.1111/cogs.12695] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 09/26/2018] [Accepted: 09/26/2018] [Indexed: 12/01/2022]

Kolling N, Scholl J, Chekroud A, Trier HA, Rushworth MFS. Prospection, Perseverance, and Insight in Sequential Behavior. Neuron 2018;99:1069-1082.e7. [PMID: 30189202 PMCID: PMC6127030 DOI: 10.1016/j.neuron.2018.08.018] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 06/14/2018] [Accepted: 08/16/2018] [Indexed: 12/29/2022]

Kolling N, O'Reilly JX. State-change decisions and dorsomedial prefrontal cortex: the importance of time. Curr Opin Behav Sci 2018;22:152-160. [PMID: 30123818 PMCID: PMC6095941 DOI: 10.1016/j.cobeha.2018.06.017] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Sweis BM, Thomas MJ, Redish AD. Mice learn to avoid regret. PLoS Biol 2018;16:e2005853. [PMID: 29927938 PMCID: PMC6013153 DOI: 10.1371/journal.pbio.2005853] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 05/14/2018] [Indexed: 11/19/2022] Open

Abstract

Regret can be defined as the subjective experience of recognizing that one has made a mistake and that a better alternative could have been selected. The experience of regret is thought to carry negative utility. This typically takes two distinct forms: augmenting immediate postregret valuations to make up for losses, and augmenting long-term changes in decision-making strategies to avoid future instances of regret altogether. While the short-term changes in valuation have been studied in human psychology, economics, neuroscience, and even recently in nonhuman-primate and rodent neurophysiology, the latter long-term process has received far less attention, with no reports of regret avoidance in nonhuman decision-making paradigms. We trained 31 mice in a novel variant of the Restaurant Row economic decision-making task, in which mice make decisions of whether to spend time from a limited budget to achieve food rewards of varying costs (delays). Importantly, we tested mice longitudinally for 70 consecutive days, during which the task provided their only source of food. Thus, decision strategies were interdependent across both trials and days. We separated principal commitment decisions from secondary reevaluation decisions across space and time and found evidence for regret-like behaviors following change-of-mind decisions that corrected prior economically disadvantageous choices. Immediately following change-of-mind events, subsequent decisions appeared to make up for lost effort by altering willingness to wait, decision speed, and pellet consumption speed, consistent with past reports of regret in rodents. As mice were exposed to an increasingly reward-scarce environment, we found they adapted and refined distinct economic decision-making strategies over the course of weeks to maximize reinforcement rate. However, we also found that even without changes in reinforcement rate, mice transitioned from an early strategy rooted in foraging to a strategy rooted in deliberation and planning that prevented future regret-inducing change-of-mind episodes from occurring. These data suggest that mice are learning to avoid future regret, independent of and separate from reinforcement rate maximization.

Regret describes a unique postdecision phenomenon in which losses are realized as a fault of one’s own actions. Regret is often hypothesized to have an inherent negative utility, and humans will often incur costs so as to avoid the risk of future regret. However, current models of nonhuman decision-making are based on reward maximization hypotheses. We recently found that rats express regret behaviorally and neurophysiologically on neuroeconomic foraging tasks; however, it remains unknown whether nonhuman animals will change strategies so as to avoid regret, even in the absence of changes in the achieved rate of reinforcement. Here, we provide the first evidence that mice change strategies to avoid future regret, independent of and separate from reinforcement rate maximization. Our data suggest mice accomplish this by shifting from a foraging decision-making strategy that produces change-of-mind decisions after investment mistakes to one rooted in deliberation that learns to plan ahead.

Collapse

Altering gain of the infralimbic-to-accumbens shell circuit alters economically dissociable decision-making algorithms. Proc Natl Acad Sci U S A 2018;115:E6347-E6355. [PMID: 29915034 DOI: 10.1073/pnas.1803084115] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

The nucleus accumbens shell (NAcSh) is involved in reward valuation. Excitatory projections from infralimbic cortex (IL) to NAcSh undergo synaptic remodeling in rodent models of addiction and enable the extinction of disadvantageous behaviors. However, how the strength of synaptic transmission of the IL-NAcSh circuit affects decision-making information processing and reward valuation remains unknown, particularly because these processes can conflict within a given trial and particularly given recent data suggesting that decisions arise from separable information-processing algorithms. The approach of many neuromodulation studies is to disrupt information flow during on-going behaviors; however, this limits the interpretation of endogenous encoding of computational processes. Furthermore, many studies are limited by the use of simple behavioral tests of value which are unable to dissociate neurally distinct decision-making algorithms. We optogenetically altered the strength of synaptic transmission between glutamatergic IL-NAcSh projections in mice trained on a neuroeconomic task capable of separating multiple valuation processes. We found that induction of long-term depression in these synapses produced lasting changes in foraging processes without disrupting deliberative processes. Mice displayed inflated reevaluations to stay when deciding whether to abandon continued reward-seeking investments but displayed no changes during initial commitment decisions. We also developed an ensemble-level measure of circuit-specific plasticity that revealed individual differences in foraging valuation tendencies. Our results demonstrate that alterations in projection-specific synaptic strength between the IL and the NAcSh are capable of augmenting self-control economic valuations within a particular decision-making modality and suggest that the valuation mechanisms for these multiple decision-making modalities arise from different circuits.

Collapse