TIMAT: Temporal Information Multi-Agent Transformer.

Qitong Kang,Fuyong Wang,Zhongxin Liu,Zengqiang Chen
DOI: https://doi.org/10.5555/3635637.3663147
2024-01-01
Abstract:In many specific tasks, training models with Multi-Agent Reinforcement Learning (MARL) to solve a task often leads to overfitting to the training environment. When dealing with multi-task, models specialized for a single task often fail to generalize, and retraining models often implies the consumption of computational resources. Therefore, it is necessary to establish a pre-trained model that can be quickly deployed in an online environment. Therefore, we propose temporal information multi-agent transformer (TIMAT) based on the transformer that extracts temporal information and models MARL as Sequence Models (SM). The advantage of this framework is that it can handle time information of arbitrary length and any number of agents regardless of the type, which greatly enhances the generalization ability of the model.
What problem does this paper attempt to address?