Abstract:This research aims to solve the safe navigation problem of autonomous underwater vehicles (AUVs) in deep ocean, which is a complex and changeable environment with various mountains. When an AUV reaches the deep sea navigation, it encounters many underwater canyons, and the hard valley walls threaten its safety seriously. To solve the problem on the safe driving of AUV in underwater canyons and address the potential of AUV autonomous obstacle avoidance in uncertain environments, an improved AUV path planning algorithm based on the deep deterministic policy gradient (DDPG) algorithm is proposed in this work. This method refers to an end-to-end path planning algorithm that optimizes the strategy directly. It takes sensor information as input and driving speed and yaw angle as outputs. The path planning algorithm can reach the predetermined target point while avoiding large-scale static obstacles, such as valley walls in the simulated underwater canyon environment, as well as sudden small-scale dynamic obstacles, such as marine life and other vehicles. In addition, this research aims at the multi-objective structure of the obstacle avoidance of path planning, modularized reward function design, and combined artificial potential field method to set continuous rewards. This research also proposes a new algorithm called deep SumTree-deterministic policy gradient algorithm (SumTree-DDPG), which improves the random storage and extraction strategy of DDPG algorithm experience samples. According to the importance of the experience samples, the samples are classified and stored in combination with the SumTree structure, high-quality samples are extracted continuously, and SumTree-DDPG algorithm finally improves the speed of the convergence model. Finally, this research uses Python language to write an underwater canyon simulation environment and builds a deep reinforcement learning simulation platform on a high-performance computer to conduct simulation learning training for AUV. Data simulation verified that the proposed path planning method can guide the under-actuated underwater robot to navigate to the target without colliding with any obstacles. In comparison with the DDPG algorithm, the stability, training's total reward, and robustness of the improved Sumtree-DDPG algorithm planner in this study are better.

AUV Collision Avoidance Planning Method Based on Deep Deterministic Policy Gradient

Research on Method of Collision Avoidance Planning for UUV Based on Deep Reinforcement Learning

AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning

A 2d Optimal Path Planning Algorithm For Autonomous Underwater Vehicle Driving In Unknown Underwater Canyons

DRL-based Path Planning and Obstacle Avoidance of Autonomous Underwater Vehicle

Research on collision avoidance algorithm of unmanned surface vehicle based on deep reinforcement learning

Design and Field Test of Collision Avoidance Method with Prediction for USVs: A Deep Deterministic Policy Gradient Approach

Action Guidance-Based Deep Interactive Reinforcement Learning for AUV Path Planning

Communication-Aware Motion Planning of AUV in Obstacle-Dense Environment: A Binocular Vision-Based Deep Learning Method.

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments

Research on Obstacle Avoidance Planning for UUV Based on A3C Algorithm

Real-time Planning and Collision Avoidance Control Method Based on Deep Reinforcement Learning

A Learning Method for AUV Collision Avoidance Through Deep Reinforcement Learning

Reinforcement Learning Based Obstacle Avoidance for AUV Swarm in Dynamic Ocean Environment

A path planning strategy unified with a COLREGS collision avoidance function based on deep reinforcement learning and artificial potential field

Proximal policy optimization with reciprocal velocity obstacle based collision avoidance path planning for multi-unmanned surface vehicles

Research and Design of an Autonomous Underwater Vehicle Path Planning Method Based on Deep Reinforcement Learning

An Improved Quantum-Behaved Particle Swarm Optimization Algorithm Combined with Reinforcement Learning for AUV Path Planning

A Balanced Collision Avoidance Algorithm for USVs in Complex Environment: A Deep Reinforcement Learning Approach

AUV Collision Avoidance Strategy Based on Fuzzy Reinforcement Learning

Path Planning of Unmanned Underwater Vehicles Based on Deep Reinforcement Learning Algorithm