AUV path following controlled by modified Deep Deterministic Policy Gradient

Yushan Sun,Xiangrui Ran,Guocheng Zhang,Xiangbin Wang,Hao Xu
DOI: https://doi.org/10.1016/j.oceaneng.2020.107360
IF: 5
2020-08-01
Ocean Engineering
Abstract:<p>This study proposes a Deep Deterministic Policy Gradient algorithm based on optimized sample pools and average motion critic network (OSAM-DDPG) to realize the path following control of autonomous underwater vehicles (AUVs). The ideas of optimizing the sampling mode and the evaluation of motion are proposed to improve the efficiency of algorithm. OSAM-DDPG is used to train the force-to-state mapping of an AUV's dynamical model to realize its control. In the simulation test, the OSAM-DDPG algorithm only needs some episodes to obtain the complete control strategy. Based on the experience gained from the training, the problems of various paths following in the interference environment can be addressed, and the results demonstrate that the effect of path following control based on OSAM-DDPG is better than S-plane.</p>
engineering, civil, ocean, marine,oceanography
What problem does this paper attempt to address?