• Reference Citation Analysis
  • v
  • v
  • Find an Article
  • Find an Author
Download
For: Vien NA, Ertel W, Chung TC. Learning via human feedback in continuous state and action spaces. APPL INTELL 2013. [DOI: 10.1007/s10489-012-0412-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Number Cited by Other Article(s)
1
Mourad N, Ezzeddine A, Nadjar Araabi B, Nili Ahmadabadi M. Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach. Journal of Robotics 2020;2020:1-18. [DOI: 10.1155/2020/3849309] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
2
Zhao X, Ding S, An Y, Jia W. Applications of asynchronous deep reinforcement learning based on dynamic updating weights. APPL INTELL 2018. [DOI: 10.1007/s10489-018-1296-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
3
Celemin C, Ruiz-del-solar J. An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback. J INTELL ROBOT SYST 2019;95:77-97. [DOI: 10.1007/s10846-018-0839-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
4
Jagodnik KM, Thomas PS, van den Bogert AJ, Branicky MS, Kirsch RF. Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards. IEEE Trans Neural Syst Rehabil Eng 2017;25:1892-1905. [PMID: 28475063 DOI: 10.1109/tnsre.2017.2700395] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
5
Vien NA, Lee S, Chung T. Bayes-adaptive hierarchical MDPs. APPL INTELL 2016. [DOI: 10.1007/s10489-015-0742-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
6
Ngo H, Luciw M, Nagi J, Forster A, Schmidhuber J, Vien NA. Efficient Interactive Multiclass Learning from Binary Feedback. ACM T INTERACT INTEL 2014. [DOI: 10.1145/2629631] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
7
Kusy M, Zajdel R. Probabilistic neural network training procedure based on Q(0)-learning algorithm in medical data classification. APPL INTELL 2014;41:837-54. [DOI: 10.1007/s10489-014-0562-9] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
8
Vien NA, Ngo H, Lee S, Chung T. Approximate planning for bayesian hierarchical reinforcement learning. APPL INTELL 2014;41:808-19. [DOI: 10.1007/s10489-014-0565-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
9
Wu B, Zheng HY, Feng YP. Point-based online value iteration algorithm in large POMDP. APPL INTELL 2013. [DOI: 10.1007/s10489-013-0479-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
10
Abdoos M, Mozayani N, Bazzan ALC. Hierarchical control of traffic signals using Q-learning with tile coding. APPL INTELL 2013. [DOI: 10.1007/s10489-013-0455-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
PrevPage 1 of 1 1Next
© 2004-2024 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA