Neural-network-based Deterministic Policy Gradient for Depth Control of AUVs

Hui Wu,Shiji Song,Keyou You,Cheng Wu
DOI: https://doi.org/10.1109/cac.2017.8242882
2017-01-01
Abstract:This paper considers the depth control problem of autonomous underwater vehicles (AUVs) in discrete time. A neural-network-based deterministic policy gradient (NNDPG) controller is proposed by combining the deterministic policy gradient theorem with neural networks. Two networks, evaluation network and policy network, are designed to respectively approximate the long-term cost function and policy function. Several heuristic strategies, including prioritized experience replay and target networks, are incorporated into the algorithm to improve the robustness of convergence. Simulation results on two different AUV models are presented to demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?