Abstract:The autonomous navigation and obstacle avoidance capabilities of autonomous underwater vehicles (AUVs) are essential for ensuring their safe navigation and long-term, efficient operation. However, the complexity of the marine environment poses significant challenges to safe and effective obstacle avoidance. To address this issue, this study proposes an AUV obstacle avoidance control algorithm based on offline reinforcement learning. This method adopts the Conservative Q-learning (CQL) algorithm, which is based on the Soft Actor-Critic (SAC) framework. It learns from obtained historical obstacle avoidance data and ultimately achieves a favorable obstacle avoidance control strategy. In this method, PID and SAC control algorithms are utilized to generate expert obstacle avoidance data to construct a diversified offline database. Additionally, based on the line-of-sight (LOS) guidance method and artificial potential field (APF) method, information regarding the distance and orientation of targets and obstacles is incorporated into the state space, and heading and obstacle avoidance reward terms are integrated into the reward function design. The algorithm successfully guides the AUV in autonomous navigation and dynamic obstacle avoidance in three-dimensional space. Furthermore, the algorithm exhibits a certain degree of anti-interference capability against uncertain disturbances and ocean currents, enhancing the safety and robustness of the AUV system. Simulation results fully demonstrate the feasibility and effectiveness of the intelligent obstacle avoidance method based on offline reinforcement learning. This study highlights the profound significance of offline reinforcement learning in enabling robust and reliable control systems for AUVs, paving the way for enhanced operational capabilities in challenging marine environments.

Intelligent AUV Surfacing Control in Network Attack Scenario

Learning an End-To-End Policy for AUV Control Within Just Forty Minutes Using Parallel Simulation

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

AUV Obstacle Avoidance Framework Based on Event-Triggered Reinforcement Learning

A Learning Method for AUV Collision Avoidance Through Deep Reinforcement Learning

Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance

Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

AUV position tracking and trajectory control based on fast-deployed deep reinforcement learning method

Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle

AUV Path Following Control using Deep Reinforcement Learning Under the Influence of Ocean Currents.

Path-Following Control of Unmanned Underwater Vehicle Based on an Improved TD3 Deep Reinforcement Learning

Sim-to-real transfer of adaptive control parameters for AUV stabilisation under current disturbance

Secure and Cooperative Target Tracking Via AUV Swarm - A Reinforcement Learning Approach.

Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle

A Fast Adaptive AUV Control Policy Based on Progressive Networks with Context Information

Research and Design of an Autonomous Underwater Vehicle Path Planning Method Based on Deep Reinforcement Learning

Reinforcement Learning Based Obstacle Avoidance for AUV Swarm in Dynamic Ocean Environment

Path planning of autonomous underwater vehicle in unknown environment based on improved deep reinforcement learning

Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning

Research on collision avoidance algorithm of unmanned surface vehicle based on deep reinforcement learning

Deep Reinforcement Learning Controller for 3D Path Following and Collision Avoidance by Autonomous Underwater Vehicles