Abstract:StarCraft II is a challenging benchmark for AI agents due to the necessity of both precise micro level operations and strategic macro awareness. Previous works, such as Alphastar and SCC, achieve impressive performance on tackling StarCraft II , however, still exhibit deficiencies in long term strategic planning and strategy interpretability. Emerging large language model (LLM) agents, such as Voyage and MetaGPT, presents the immense potential in solving intricate tasks. Motivated by this, we aim to validate the capabilities of LLMs on StarCraft II, a highly complex RTS <a class="link-external link-http" href="http://game.To" rel="external noopener nofollow">this http URL</a> conveniently take full advantage of LLMs` reasoning abilities, we first develop textual StratCraft II environment, called TextStarCraft II, which LLM agent can interact. Secondly, we propose a Chain of Summarization method, including single frame summarization for processing raw observations and multi frame summarization for analyzing game information, providing command recommendations, and generating strategic decisions. Our experiment consists of two parts: first, an evaluation by human experts, which includes assessing the LLMs`s mastery of StarCraft II knowledge and the performance of LLM agents in the game; second, the in game performance of LLM agents, encompassing aspects like win rate and the impact of Chain of Summarization.Experiment results demonstrate that: 1. LLMs possess the relevant knowledge and complex planning abilities needed to address StarCraft II scenarios; 2. Human experts consider the performance of LLM agents to be close to that of an average player who has played StarCraft II for eight years; 3. LLM agents are capable of defeating the built in AI at the Harder(Lv5) difficulty level. We have open sourced the code and released demo videos of LLM agent playing StarCraft II.

Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

Learning Macromanagement in Starcraft by Deep Reinforcement Learning

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Knowledge-Guided Agent-Tactic-Aware Learning for StarCraft Micromanagement

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization.

RODE: Learning Roles to Decompose Multi-Agent Tasks

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

A Hierarchical Model for StarCraft II Mini-Game

Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning

Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

Learning Macromanagement in StarCraft from Replays using Deep Learning

Innate-Values-driven Reinforcement Learning for Cooperative Multi-Agent Systems

Cooperative multi-agent game based on reinforcement learning

From mimic to counteract: a two-stage reinforcement learning algorithm for Google research football

Learning Multi-Agent Cooperation via Considering Actions of Teammates

On Stateful Value Factorization in Multi-Agent Reinforcement Learning

Relation-Aware Learning for Multi-Task Multi-Agent Cooperative Games

MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning

Boosting Value Decomposition Via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning