Action Guidance-Based Deep Interactive Reinforcement Learning for AUV Path Planning

Dong Jiang,Zheng Fang,Chunxi Cheng,Bo He,Guangliang Li
DOI: https://doi.org/10.1109/mlcr57210.2022.00037
2022-01-01
Abstract:Autonomous underwater vehicle (AUV) is playing an increasingly important role in marine scientific research and resource exploration due to its autonomy and flexibility. As the core technology to improve AUV's autonomy, path planning facilitates AUV to complete its mission safely by avoiding obstacles in the route. In this paper, we proposed an action guidance-based interactive deep deterministic policy gradient (IDDPG) method for AUV path planning. The human trainer can provide a suggested action based on the real-time state of AUV and indirectly assign reward values by comparing the suggested action with the one selected by control policy. We tested IDDPG and compared to the original DDPG algorithm in two path planning tasks: obstacle boundary detour and local obstacle avoidance, on the Gazebo simulation platform. The simulation results show that IDDPG can alleviate low sample efficiency problem of traditional deep reinforcement learning and significantly improve the learning speed of AUV. In addition, IDDPG is shown to generalize better to untrained new environments than DDPG.
What problem does this paper attempt to address?