Multi-Agent Reinforcement Learning for Unmanned Surface Vehicle Hunting Target

Ning Zhao,Zhe Liu,Shan Xue,Weidong Zhang
DOI: https://doi.org/10.1109/aiea62095.2024.10692565
2024-01-01
Abstract:A multi-agent reinforcement learning-based hunting algorithm is proposed for the problem of unmanned surface vehicle (USV) target hunting. First, the kinematic and dynamic models of the USV are constructed, and the conditions for successful target hunting are provided. Further, the cooperative hunting problem involving multiple USVs is modeled as a Markov decision process (MDP), and the state space, action space, and reward function are designed. In addition, a framework for a multi-agent reinforcement learning network and a process for a multi-agent deterministic policy gradient (MADDPG) algorithm are established. Finally, a centralized training and decentralized execution framework is used to complete the training of hunting strategies, where each USV in the cluster shares the same approach and executes actions independently. The proposed method is applicable to hunting scenarios with a continuous action space and a high-dimensional state space. Simulation experiments confirm the effectiveness of the designed reward function and the MADDPG algorithm in multi-USV hunting target scenarios.
What problem does this paper attempt to address?