Mpc-Based Model-Based Reinforcement Learning and Planning for Auv Path Following

Zheng Zhang,Tianhao Chen,Dong Jiang,Zheng Fang,Guangliang Li
DOI: https://doi.org/10.2139/ssrn.4349138
2023-01-01
Abstract:Autonomous underwater vehicle plays a crucial role in marine resource exploitation and marine scientific research due to its flexibility. Recently, model-free reinforcement learning has been introduced to improve the autonomy of AUV. However, model-free reinforcement learning is inefficient in sampling and time-consuming in training. In this paper, we proposed and implemented MPC-based model-based reinforcement learning (MB-MPC) for AUV path following. We tested our method in a trapezoidal following task on a Gazebo simulation underwater environment extended from Unmanned Underwater Vehicle Simulator by modelling our Sailfish 210. In addition, to show the generalization and stability of our method, we tested the trained policies in the original trapezoidal following task and a new triangular following task under the disturbance of small and large ocean currents, respectively. Our simulation results show that AUV path following via our method can obtain a better performance much faster than model-free reinforcement learning method. Moreover, AUV trained via MB-MPC method can generalize and adapt better to uncertainty and new tasks than model-free methods. In addition, planning via MPC plays an important role for overcoming the disturbance of ocean currents, especially at the point of turning.
What problem does this paper attempt to address?