Collaborative Encirclement of Multiple UAVs Based on Deep Reinforcement Learning

Weilai Jiang,Tianqing Cai,Chenghong Zheng,Yaonan Wang
DOI: https://doi.org/10.1109/ccdc62350.2024.10588163
2024-01-01
Abstract:Multi UAVs collaborative encirclement, as an important component of multi UAVs collaborative tasks, has become a key growth point for future UAVs new combat capabilities. However, there are problems with low success rate, poor collaborative ability, and easy destruction of the encirclement formation in collaborative encirclement by multiple UAVs. To address these problems, this paper proposes a gradient reinforcement learning algorithm with multiple attention depth dual delay deterministic strategies. In this algorithm, the dual delay deterministic strategy gradient effectively solves the problem of excessive errors in UAVs collaborative decision-making caused by high output Q values of the Critic network. The multi head attention mechanism can improve the efficiency of UAVs collaborative decision-making. In the experiment, in order to adapt to the adversarial nature of both sides in actual encirclement, a deep deterministic strategy gradient algorithm was used to train the escape target. The escape target can autonomously adjust its escape strategy to escape the coordinated encirclement of UAVs as much as possible. The experimental results show that the UAVs trained by the proposed algorithm can efficiently complete collaborative capture tasks and has better convergence characteristics. Finally, the multi head attention mechanism was introduced into the multi-agent deep deterministic policy gradient reinforcement learning algorithm, verifying its good applicability in improving the performance of multi-agent deterministic policy reinforcement learning algorithms.
What problem does this paper attempt to address?