Cross-Domain Communications Between Agents Via Adversarial-Based Domain Adaptation in Reinforcement Learning

Lichao Meng,Jingjing Li,Ke Lu
DOI: https://doi.org/10.1109/icc45855.2022.9838583
2022-01-01
Abstract:Reinforcement learning is suitable for solving sequential decision-making problems, and deep reinforcement learning methods have shown excellent performance in many fields. However, agents often face the challenge of a large number of interactions with the environment, which means that it is unrealistic to train agents from scratch in each new domain. In order to overcome this problem, our paper introduces the cross-domain communications between RL agents, so that agents in new domains can receive and use the information sent by agents trained in related domains to assist decision-making. Specifically, this paper uses adversarial-based domain adaptation methods and multi-granular loss constraints to realize implicit communications between cross-domain agents and encourage agents in different domains to extract domain-invariant information for communication and sharing, thereby the optimal behavior policy of the agent training based on the shared information in the source domain can be transferred to the related domain and achieve the expected performance. Finally, we evaluate our method on various variants of Car-Racing games, and the results show that this method can achieve efficient information communications between cross-domain agents and better performance than previous methods.
What problem does this paper attempt to address?