Graph Neural Network-based Multi-agent Reinforcement Learning for Resilient Distributed Coordination of Multi-Robot Systems

Anthony Goeckner,Yueyuan Sui,Nicolas Martinet,Xinliang Li,Qi Zhu
2024-03-20
Abstract:Existing multi-agent coordination techniques are often fragile and vulnerable to anomalies such as agent attrition and communication disturbances, which are quite common in the real-world deployment of systems like field robotics. To better prepare these systems for the real world, we present a graph neural network (GNN)-based multi-agent reinforcement learning (MARL) method for resilient distributed coordination of a multi-robot system. Our method, Multi-Agent Graph Embedding-based Coordination (MAGEC), is trained using multi-agent proximal policy optimization (PPO) and enables distributed coordination around global objectives under agent attrition, partial observability, and limited or disturbed communications. We use a multi-robot patrolling scenario to demonstrate our MAGEC method in a ROS 2-based simulator and then compare its performance with prior coordination approaches. Results demonstrate that MAGEC outperforms existing methods in several experiments involving agent attrition and communication disturbance, and provides competitive results in scenarios without such anomalies.
Multiagent Systems,Robotics
What problem does this paper attempt to address?
This paper proposes an approach called Multi-Agent Graph Embedding-based Coordination (MAGEC) which is based on Graph Neural Networks (GNN) for Multi-Agent Reinforcement Learning (MARL). MAGEC aims to enhance the robust distributed coordination of multi-robot systems. Existing multi-agent coordination techniques are often fragile when facing exceptional situations such as agent loss and communication interference. MAGEC utilizes Proximal Policy Optimization (PPO) for multi-agent policy optimization during training, enabling distributed coordination under global objectives, even in the presence of agent loss, partial observability, and limited or interfered communication conditions. The core of MAGEC lies in utilizing GNN to process local information in the environment and learning graph embeddings through the message passing process to capture the attributes of nodes and edges, as well as their complex relationships. This approach aims to address the coordination challenges of multi-robot systems in the presence of agent loss and communication problems, especially in real-world environments with frequent interferences such as battlefields. In the paper, MAGEC is validated through simulations in a multi-robot patrol scenario and compared with existing coordination methods. The results demonstrate that MAGEC outperforms in various experiments involving loss and communication interference, and remains competitive in scenarios without such exceptional situations. Furthermore, the paper discusses related work including the application of GNN in multi-agent reinforcement learning, multi-agent patrol algorithms, and provides detailed descriptions of problem definition, design approach, training environment, and evaluation methods. Overall, MAGEC offers a novel and interference-resilient multi-agent coordination strategy, aiming to improve the practicality and adaptability of multi-robot systems.