Multi-Agent Exploration Via Self-Learning and Social Learning

Shaokang Dong,Chao Li,Wubing Chen,Hongye Cao,Wenbin Li,Yang Gao
DOI: https://doi.org/10.1109/icassp48485.2024.10446068
2024-01-01
Abstract:Self-learning and social learning stand as two pivotal constituents in multi-agent exploration. Inspired by the fact that animals and humans explore unfamiliar environments to learn survival skills by training themselves using unlabeled data and replicating others' successful experiences, we propose a multi-agent reinforcement learning method, named Self-Learning and Social Learning (S 2 L), which aims to address the complex tasks caused by sparse rewards and intricate sequential structures. Specifically, in Self-Learning, we incorporate both task-specific and task-agnostic intrinsic rewards. These incentives steer individual agents towards exploration and comprehension of the environment. Furthermore, in Social Learning, different independent agents can implicitly share the successful experience by observing others in view and without additional communication or parameter-sharing overhead. Finally, experimental evaluation of S 2 L on the complex task characterized by sparse rewards and intricate sequential structures demonstrates its superior performance against other competing exploration baselines.
What problem does this paper attempt to address?