Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li L, Li D, Song T, Xu X. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction. IEEE Trans Neural Netw Learn Syst 2018;29:5899-5909. [PMID: 29993664 DOI: 10.1109/tnnls.2018.2808203] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

For:	Li L, Li D, Song T, Xu X. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction. IEEE Trans Neural Netw Learn Syst 2018;29:5899-5909. [PMID: 29993664 DOI: 10.1109/tnnls.2018.2808203] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Number

Cited by Other Article(s)

Shi J, Chu C, Fan G, Hu D, Liu J, Wang Z, Hu S. Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution. IEEE TRANSACTIONS ON CYBERNETICS 2025;PP:776-785. [PMID: 40030871 DOI: 10.1109/tcyb.2024.3507830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]

Yao L, Zhao B, Xu X, Wang Z, Wong PK, Hu Y. Efficient Incremental Offline Reinforcement Learning With Sparse Broad Critic Approximation. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS 2024;54:156-169. [DOI: 10.1109/tsmc.2023.3305498] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]

Ren J, Lan Y, Xu X, Zhang Y, Fang Q, Zeng Y. Deep reinforcement learning using least‐squares truncated temporal‐difference. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2023. [DOI: 10.1049/cit2.12202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open

Sparse online maximum entropy inverse reinforcement learning via proximal optimization and truncated gradient. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Li L, Li D, Song T, Xu X. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:1217-1227. [PMID: 32324571 DOI: 10.1109/tnnls.2020.2981377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Labao AB, Martija MAM, Naval PC. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:1162-1176. [PMID: 32287019 DOI: 10.1109/tnnls.2020.2980743] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

A Style-Specific Music Composition Neural Network. Neural Process Lett 2020. [DOI: 10.1007/s11063-020-10241-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]