Abstract:This work proposes an innovative path-following control method, anchored in deep reinforcement learning (DRL), for unmanned underwater vehicles (UUVs). This approach is driven by several new designs, all of which aim to enhance learning efficiency and effectiveness and achieve high-performance UUV control. Specifically, a novel experience replay strategy is designed and integrated within the twin-delayed deep deterministic policy gradient algorithm (TD3). It distinguishes the significance of stored transitions by making a trade-off between rewards and temporal-difference (TD) errors, thus enabling the UUV agent to explore optimal control policies more efficiently. Another major challenge within this control problem arises from action oscillations associated with DRL policies. This issue leads to excessive system wear on actuators and makes real-time application difficult. To mitigate this challenge, a newly improved regularization method is proposed, which provides a moderate level of smoothness to the control policy. Furthermore, a dynamic reward function featuring adaptive constraints is designed to avoid unproductive exploration and expedite learning convergence speed further. Simulation results show that our method garners higher rewards in fewer training episodes compared with mainstream DRL-based control approaches (e.g., deep deterministic policy gradient (DDPG) and vanilla TD3) in UUV applications. Moreover, it can adapt to varying path configurations amid uncertainties and disturbances, all while ensuring high tracking accuracy. Simulation and experimental studies are conducted to verify the effectiveness.

Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle

Continuous Control for Autonomous Underwater Vehicle Path Following Using Deep Interactive Reinforcement Learning

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

Path planning of autonomous underwater vehicle in unknown environment based on improved deep reinforcement learning

Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle

A Path Planning Method Based on Deep Reinforcement Learning for AUV in Complex Marine Environment

Action Guidance-Based Deep Interactive Reinforcement Learning for AUV Path Planning

AUV Path Following Control using Deep Reinforcement Learning Under the Influence of Ocean Currents.

Path Planning based on Deep Reinforcement Learning for Autonomous Underwater Vehicles under Ocean Current Disturbance

Path-Following Control of Unmanned Underwater Vehicle Based on an Improved TD3 Deep Reinforcement Learning

Three-Dimensional Path-Following Control of an Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

Target Search Control Of Auv In Underwater Environment With Deep Reinforcement Learning

Underwater Multi-agent Cooperative Formation Hunting Based on Deep Reinforcement Learning

An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment

Imitation Learning from Imperfect Demonstrations for AUV Path Tracking and Obstacle Avoidance

Deep Reinforcement Learning with Model Predictive Control for Path Following of Autonomous Underwater Vehicle

Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control

Deep Reinforcement Learning Based Optimal Trajectory Tracking Control of Autonomous Underwater Vehicle

Target following for an autonomous underwater vehicle using regularized ELM-based reinforcement learning

AUV Obstacle Avoidance Framework Based on Event-Triggered Reinforcement Learning