Centralized control for multi-agent RL in a complex Real-Time-Strategy game

Roger Creus Castanyer
2023-04-26
Abstract:Multi-agent Reinforcement learning (MARL) studies the behaviour of multiple learning agents that coexist in a shared environment. MARL is more challenging than single-agent RL because it involves more complex learning dynamics: the observations and rewards of each agent are functions of all other agents. In the context of MARL, Real-Time Strategy (RTS) games represent very challenging environments where multiple players interact simultaneously and control many units of different natures all at once. In fact, RTS games are so challenging for the current RL methods, that just being able to tackle them with RL is interesting. This project provides the end-to-end experience of applying RL in the Lux AI v2 Kaggle competition, where competitors design agents to control variable-sized fleets of units and tackle a multi-variable optimization, resource gathering, and allocation problem in a 1v1 scenario against other competitors. We use a centralized approach for training the RL agents, and report multiple design decisions along the process. We provide the source code of the project: <a class="link-external link-https" href="https://github.com/roger-creus/centralized-control-lux" rel="external noopener nofollow">this https URL</a>.
Artificial Intelligence,Multiagent Systems
What problem does this paper attempt to address?
The paper focuses on addressing the challenges involved in applying Reinforcement Learning (RL) techniques, specifically Multi-Agent Reinforcement Learning (MARL), to a complex Real-Time Strategy (RTS) game environment named Lux AI v2. The primary goal is to develop a centralized control scheme for training RL agents in this environment. Here's a summary of the problems the paper aims to solve: 1. **Coordination and Control in Multi-Agent Settings**: The paper addresses the coordination and control of multiple agents in a shared environment, which is inherently more complex than single-agent RL due to the dynamic interplay between agents. 2. **Complexity of RTS Games**: RTS games like Lux AI v2 pose significant challenges for RL methods due to their inherent complexity, including: - **Large State and Action Spaces**: The environment features a vast number of possible states and actions. - **Non-Stationary Dynamics**: The environment is non-stationary, meaning the underlying dynamics change over time. - **Sparse Rewards**: Rewards are often sparse, making it difficult for agents to learn meaningful policies. 3. **Engineering Challenges in Lux AI v2**: Specific engineering challenges in the Lux AI v2 environment include: - **Preprocessing Observations**: The raw observations need to be preprocessed before being fed into the RL agent. - **Variable-Sized Fleets**: Players must manage variable-sized fleets of units, issuing actions at each turn. - **Different