Deep Reinforcement Learning Based Multi-UUV Cooperative Control for Target Capturing

Qianxin Xia,Zhong Wang,Zhiwen Wen,Weijun Cai
DOI: https://doi.org/10.1109/DASC/PiCom/CBDCom/Cy55231.2022.9927810
2022-09-12
Abstract:Cooperative target capturing is of significance in the field of autonomous coordination and swarm intelligence for underwater unmanned system. Existing target capturing strategies using multiple unmanned underwater vehicles (UUVs) usually lack adaptability and often fail in face of unknown and time-varying marine environment. By leveraging multi-agent deep reinforcement learning (MADDPG) techniques, this paper investigates an intelligent and cooperative target capturing scheme with better adaptability for UUV swarm in the dynamic adversarial environment. Specifically, we first present a dynamic UUV formation method for collaborative target capturing. Due to limitations of the distribution method by angular position, the desired capturing radius and angular spacing are selected as control indicators in the polar coordinate system. Then, by converting these two control indices into one in Cartesian coordinate, we devise an adaptive dynamic allocation strategy of interested round-up points. Finally, we combine both MADDPG and DDPG algorithms in the multi-agent particle environment (MPE) to perform adversarial training between the UUV formation and the targets to be captured. Extensive simulations demonstrate that UUVs can adaptively and cooperatively form a circular formation near the target without collision, and can successfully round up escaping targets by keeping them within the capturing circle.
Engineering,Environmental Science,Computer Science
What problem does this paper attempt to address?