Auv 3d Docking Control Using Deep Reinforcement Learning

Tianze Zhang,Xuhong Miao,Yibin Li,Lei Jia,Zheng Wei,Qingtao Gong,Tao Wen
DOI: https://doi.org/10.1016/j.oceaneng.2023.115021
IF: 5
2023-01-01
Ocean Engineering
Abstract:Autonomous docking can enable AUV to have long endurance, so it is necessary to consider the issue of robust docking control under current and wave disturbances. In this work, based on the proximal policy optimization (PPO) algorithm, we developed a model-free docking controller to complete three-dimensional docking tasks under disturbances. To improve the performance of PPO, two mechanisms are proposed, including adaptive rollback clipping and self-generated demonstration replay. A simulation environment is constructed, including fuzzy hydrodynamic parameters, ocean current and wave disturbance model. Simulation results demonstrate that our proposed method has faster learning speed, higher robustness, and can control AUV to achieve 3D docking tasks in complex environments with a high success rate.
What problem does this paper attempt to address?