• Reference Citation Analysis
  • v
  • v
  • Find an Article
Find an Article PDF (4627316)   Today's Articles (225)   Subscriber (49582)
For: Geist M, Pietquin O. Kalman Temporal Differences. J ARTIF INTELL RES 2010. [DOI: 10.1613/jair.3077] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]  Open
Number Cited by Other Article(s)
1
Kang P, Tobler PN, Dayan P. Bayesian reinforcement learning: A basic overview. Neurobiol Learn Mem 2024;211:107924. [PMID: 38579896 DOI: 10.1016/j.nlm.2024.107924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 03/21/2024] [Accepted: 04/02/2024] [Indexed: 04/07/2024]
2
Salimibeni M, Mohammadi A, Malekzadeh P, Plataniotis KN. Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation. SENSORS (BASEL, SWITZERLAND) 2022;22:1393. [PMID: 35214293 PMCID: PMC8962978 DOI: 10.3390/s22041393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/04/2022] [Accepted: 02/07/2022] [Indexed: 06/14/2023]
3
AKF-SR: Adaptive Kalman filtering-based successor representation. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.10.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
4
Song T, Li D, Yang W, Hirasawa K. Recursive Least-Squares Temporal Difference With Gradient Correction. IEEE TRANSACTIONS ON CYBERNETICS 2021;51:4251-4264. [PMID: 30908269 DOI: 10.1109/tcyb.2019.2902342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
5
Tnunay H, Li Z, Ding Z. Distributed nonlinear Kalman filter with communication protocol. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.10.053] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
6
Khamassi M, Velentzas G, Tsitsimis T, Tzafestas C. Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning. IEEE Trans Cogn Dev Syst 2018. [DOI: 10.1109/tcds.2018.2843122] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
7
Gershman SJ. Dopamine, Inference, and Uncertainty. Neural Comput 2017;29:3311-3326. [DOI: 10.1162/neco_a_01023] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
8
Computational models as statistical tools. Curr Opin Behav Sci 2016. [DOI: 10.1016/j.cobeha.2016.07.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
9
Gershman SJ. A Unifying Probabilistic View of Associative Learning. PLoS Comput Biol 2015;11:e1004567. [PMID: 26535896 PMCID: PMC4633133 DOI: 10.1371/journal.pcbi.1004567] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 09/22/2015] [Indexed: 11/19/2022]  Open
10
Reinforcement-learning based dialogue system for human–robot interactions with socially-inspired rewards. COMPUT SPEECH LANG 2015. [DOI: 10.1016/j.csl.2015.03.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
11
Crook PA, Keizer S, Wang Z, Tang W, Lemon O. Real user evaluation of a POMDP spoken dialogue system using automatic belief compression. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2013.12.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
12
Geist M, Pietquin O. Algorithmic survey of parametric value function approximation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013;24:845-867. [PMID: 24808468 DOI: 10.1109/tnnls.2013.2247418] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
PrevPage 1 of 1 1Next
© 2004-2024 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA