Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

High-accuracy model-based reinforcement learning, a survey. Artif Intell Rev 2023. [DOI: 10.1007/s10462-022-10335-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]

Zhang J, Liu Q, Han X. Dynamic sub-route-based self-adaptive beam search Q-learning algorithm for traveling salesman problem. PLoS One 2023;18:e0283207. [PMID: 36943840 PMCID: PMC10030033 DOI: 10.1371/journal.pone.0283207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Accepted: 03/03/2023] [Indexed: 03/23/2023] Open

Monte Carlo Tree Search: a review of recent modifications and applications. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10228-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Li B. Hierarchical Architecture for Multi-Agent Reinforcement Learning in Intelligent Game. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) 2022. [DOI: 10.1109/ijcnn55064.2022.9892666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Probabilistic Plan Recognition for Multi-Agent Systems under Temporal Logic Tasks. ELECTRONICS 2022. [DOI: 10.3390/electronics11091352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Xie D, Zhong X. Semicentralized Deep Deterministic Policy Gradient in Cooperative StarCraft Games. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:1584-1593. [PMID: 33351767 DOI: 10.1109/tnnls.2020.3042943] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Khanna R, Dodge J, Anderson A, Dikkala R, Irvine J, Shureih Z, Lam KH, Matthews CR, Lin Z, Kahng M, Fern A, Burnett M. Finding AI’s Faults with AAR/AI: An Empirical Study. ACM T INTERACT INTEL 2022. [DOI: 10.1145/3487065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]

Ye D, Chen G, Zhao P, Qiu F, Yuan B, Zhang W, Chen S, Sun M, Li X, Li S, Liang J, Lian Z, Shi B, Wang L, Shi T, Fu Q, Yang W, Huang L. Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:908-918. [PMID: 33147150 DOI: 10.1109/tnnls.2020.3029475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Liu X, Tan Y. Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:252-264. [PMID: 32224477 DOI: 10.1109/tcyb.2020.2979803] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Dodge J, Khanna R, Irvine J, Lam KH, Mai T, Lin Z, Kiddle N, Newman E, Anderson A, Raja S, Matthews C, Perdriau C, Burnett M, Fern A. After-Action Review for AI (AAR/AI). ACM T INTERACT INTEL 2021. [DOI: 10.1145/3453173] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Huang W, Yin Q, Zhang J, Huang K. Learning Macromanagement in Starcraft by Deep Reinforcement Learning. SENSORS 2021;21:s21103332. [PMID: 34065012 PMCID: PMC8150573 DOI: 10.3390/s21103332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Revised: 04/27/2021] [Accepted: 05/06/2021] [Indexed: 12/02/2022]

Cuccu G, Togelius J, Cudré-Mauroux P. Playing Atari with few neurons: Improving the efficacy of reinforcement learning by decoupling feature extraction and decision making. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS 2021;35:17. [PMID: 34720684 PMCID: PMC8550197 DOI: 10.1007/s10458-021-09497-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 03/09/2021] [Indexed: 06/13/2023]

Abstract

We propose a new method for learning compact state representations and policies separately but simultaneously for policy approximation in vision-based applications such as Atari games. Approaches based on deep reinforcement learning typically map pixels directly to actions to enable end-to-end training. Internally, however, the deep neural network bears the responsibility of both extracting useful information and making decisions based on it, two objectives which can be addressed independently. Separating the image processing from the action selection allows for a better understanding of either task individually, as well as potentially finding smaller policy representations which is inherently interesting. Our approach learns state representations using a compact encoder based on two novel algorithms: (i) Increasing Dictionary Vector Quantization builds a dictionary of state representations which grows in size over time, allowing our method to address new observations as they appear in an open-ended online-learning context; and (ii) Direct Residuals Sparse Coding encodes observations in function of the dictionary, aiming for highest information inclusion by disregarding reconstruction error and maximizing code sparsity. As the dictionary size increases, however, the encoder produces increasingly larger inputs for the neural network; this issue is addressed with a new variant of the Exponential Natural Evolution Strategies algorithm which adapts the dimensionality of its probability distribution along the run. We test our system on a selection of Atari games using tiny neural networks of only 6 to 18 neurons (depending on each game's controls). These are still capable of achieving results that are not much worse, and occasionally superior, to the state-of-the-art in direct policy search which uses two orders of magnitude more neurons.

Collapse

Penney S, Dodge J, Anderson A, Hilderbrand C, Simpson L, Burnett M. The Shoutcasters, the Game Enthusiasts, and the AI: Foraging for Explanations of Real-time Strategy Players. ACM T INTERACT INTEL 2021. [DOI: 10.1145/3396047] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Zha Z, Wang B, Tang X. Evaluate, explain, and explore the state more exactly: an improved Actor-Critic algorithm for complex environment. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05663-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

A Survey of Planning and Learning in Games. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10134529] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Anderson A, Dodge J, Sadarangani A, Juozapaitis Z, Newman E, Irvine J, Chattopadhyay S, Olson M, Fern A, Burnett M. Mental Models of Mere Mortals with Explanations of Reinforcement Learning. ACM T INTERACT INTEL 2020. [DOI: 10.1145/3366485] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Badman RP, Hills TT, Akaishi R. Multiscale Computation and Dynamic Attention in Biological and Artificial Intelligence. Brain Sci 2020;10:E396. [PMID: 32575758 PMCID: PMC7348831 DOI: 10.3390/brainsci10060396] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 05/23/2020] [Accepted: 06/17/2020] [Indexed: 11/16/2022] Open

Fea MP, Boisseau RP, Emlen DJ, Holwell GI. Cybernetic combatants support the importance of duels in the evolution of extreme weapons. Proc Biol Sci 2020;287:20200254. [PMID: 32517625 DOI: 10.1098/rspb.2020.0254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

A Comparison of Evolutionary and Tree-Based Approaches for Game Feature Validation in Real-Time Strategy Games with a Novel Metric. MATHEMATICS 2020. [DOI: 10.3390/math8050688] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System. Symmetry (Basel) 2020. [DOI: 10.3390/sym12040631] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI. KUNSTLICHE INTELLIGENZ 2020. [DOI: 10.1007/s13218-020-00647-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Fuzzy Reinforcement Learning and Curriculum Transfer Learning for Micromanagement in Multi-Robot Confrontation. INFORMATION 2019. [DOI: 10.3390/info10110341] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Shao K, Zhu Y, Zhao D. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2019. [DOI: 10.1109/tetci.2018.2823329] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Pronobis W, Tkatchenko A, Müller KR. Many-Body Descriptors for Predicting Molecular Properties with Machine Learning: Analysis of Pairwise and Three-Body Interactions in Molecules. J Chem Theory Comput 2018;14:2991-3003. [DOI: 10.1021/acs.jctc.8b00110] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Procedural generation of non-player characters in massively multiplayer online strategy games. Soft comput 2017. [DOI: 10.1007/s00500-016-2238-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Modified Adversarial Hierarchical Task Network Planning in Real-Time Strategy Games. APPLIED SCIENCES-BASEL 2017. [DOI: 10.3390/app7090872] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Bosc G, Tan P, Boulicaut JF, Raissi C, Kaytoue M. A Pattern Mining Approach to Study Strategy Balance in RTS Games. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2017. [DOI: 10.1109/tciaig.2015.2511819] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Exceptional contextual subgraph mining. Mach Learn 2017. [DOI: 10.1007/s10994-016-5598-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Synnaeve G, Bessiere P. Multiscale Bayesian Modeling for RTS Games: An Application to StarCraft AI. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2015.2487743] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Liu S, Louis SJ, Ballinger CA. Evolving Effective Microbehaviors in Real-Time Strategy Games. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2016.2544844] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Ballinger C, Louis S, Liu S. Coevolving Robust Build-Order Iterative Lists for Real-Time Strategy Games. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2016.2544817] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

ghost: A Combinatorial Optimization Framework for Real-Time Problems. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2016.2573199] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Perez-Liebana D, Samothrakis S, Togelius J, Schaul T, Lucas SM, Couetoux A, Lee J, Lim CU, Thompson T. The 2014 General Video Game Playing Competition. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2015.2402393] [Citation(s) in RCA: 89] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Algorithms for computing strategies in two-player simultaneous move games. ARTIF INTELL 2016. [DOI: 10.1016/j.artint.2016.03.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Togelius J. How to Run a Successful Game-Based AI Competition. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2014.2365470] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Stanescu M, Certicky M. Predicting Opponent's Production in Real-Time Strategy Games With Answer Set Programming. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2016. [DOI: 10.1109/tciaig.2014.2365414] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Yannakakis GN, Togelius J. A Panorama of Artificial and Computational Intelligence in Games. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES 2015. [DOI: 10.1109/tciaig.2014.2339221] [Citation(s) in RCA: 82] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]