AI-Olympics: Exploring the Generalization of Agents through Open Competitions

Chen Wang,Yan Song,Shuai Wu,Sa Wu,Ruizhi Zhang,Shu Lin,Haifeng Zhang
2024-05-23
Abstract:Between 2021 and 2023, AI-Olympics, a series of online AI competitions was hosted by the online evaluation platform Jidi in collaboration with the IJCAI committee. In these competitions, an agent is required to accomplish diverse sports tasks in a two-dimensional continuous world, while competing against an opponent. This paper provides a brief overview of the competition series and highlights notable findings. We aim to contribute insights to the field of multi-agent decision-making and explore the generalization of agents through engineering efforts.
Multiagent Systems
What problem does this paper attempt to address?
The paper primarily aims to address the generalization capability of Reinforcement Learning (RL) agents across different tasks. Specifically, the paper explores multi-agent decision-making and generalization capabilities by introducing the AI-Olympics competition series. AI-Olympics is a 2D physics game engine built on Python, designed to simulate various sports scenarios and evaluate the performance of different agents under varying environments, tasks, and opponent conditions through a series of online competitions. The core objectives of the paper include: 1. **Multi-task Generalization**: Exploring how agents can maintain good performance when faced with diverse tasks, especially in unknown environments. 2. **Engineering Practice**: Providing a testing platform through actual competitions to promote practical engineering exploration of agent generalization capabilities. 3. **Multi-agent Interaction**: Studying the interaction strategies and generalization capabilities of two agents with conflicting interests in a zero-sum game environment. Through these competitions, researchers can not only evaluate the effectiveness of existing algorithms but also drive the development of new methods and technologies to enhance the adaptability and generalization capabilities of agents in complex dynamic environments.