Graph-Based Multi-agent Reinforcement Learning for Large-Scale UAVs Swarm System Control

Bocheng Zhao,Mingying Huo,Zheng Li,Ze Yu,Naiming Qi
DOI: https://doi.org/10.1016/j.ast.2024.109166
IF: 5.457
2024-04-28
Aerospace Science and Technology
Abstract:In this study, a novel graph-embedding technique based on a graph neural network (GNN) is proposed to identify the topology in the motion of a unmanned aerial vehicles (UAV) swarm and quickly obtain local information around each agent. We also propose a model reference reinforcement learning method to learn the potential field function and determine an appropriate strategy for each agent that can satisfy the requirements of collaborative motion and obstacle avoidance for large-scale UAV swarms. First, a new swarm structure is proposed to provide reserved maneuvering space for UAVs during flight. In addition, a method was proposed to encode the obstacle avoidance behavior of multiple UAVs in a continuous space into spatial maps. A graph attention mechanism (GAT) structure based on local information was proposed to obtain dynamic graph information, and each individual output action was obtained according to the current state information. To improve the training effect, this method can restrain the UAV group while maintaining the formation and preventing collisions among the UAV. Second, a new distributed control algorithm based on multi-agent reinforcement learning (MARL) is proposed by learning the potential field function using local information obtained by a GNN. Each individual can repel and cooperate with the target within a short range and attract objects over a long distance. Finally, simulation results demonstrate the effectiveness and superiority of the proposed method, which has great potential for application in online autonomous collaboration.
engineering, aerospace
What problem does this paper attempt to address?