• Reference Citation Analysis
  • v
  • v
  • Find an Article
Find an Article PDF (4699682)   Today's Articles (7052)
For: Li L, Li D, Song T, Xu X. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction. IEEE Trans Neural Netw Learn Syst 2018;29:5899-5909. [PMID: 29993664 DOI: 10.1109/tnnls.2018.2808203] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Number Cited by Other Article(s)
1
Shi J, Chu C, Fan G, Hu D, Liu J, Wang Z, Hu S. Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution. IEEE TRANSACTIONS ON CYBERNETICS 2025;PP:776-785. [PMID: 40030871 DOI: 10.1109/tcyb.2024.3507830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
2
Yao L, Zhao B, Xu X, Wang Z, Wong PK, Hu Y. Efficient Incremental Offline Reinforcement Learning With Sparse Broad Critic Approximation. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS 2024;54:156-169. [DOI: 10.1109/tsmc.2023.3305498] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
3
Ren J, Lan Y, Xu X, Zhang Y, Fang Q, Zeng Y. Deep reinforcement learning using least‐squares truncated temporal‐difference. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2023. [DOI: 10.1049/cit2.12202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]  Open
4
Sparse online maximum entropy inverse reinforcement learning via proximal optimization and truncated gradient. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
5
Li L, Li D, Song T, Xu X. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:1217-1227. [PMID: 32324571 DOI: 10.1109/tnnls.2020.2981377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
6
Labao AB, Martija MAM, Naval PC. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:1162-1176. [PMID: 32287019 DOI: 10.1109/tnnls.2020.2980743] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
7
A Style-Specific Music Composition Neural Network. Neural Process Lett 2020. [DOI: 10.1007/s11063-020-10241-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
PrevPage 1 of 1 1Next
© 2004-2025 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA