Abstract:As one of the commonly used vehicles for underwater detection, underwater robots are facing a series of problems. The real underwater environment is large-scale, complex, real-time and dynamic, and many unknown obstacles may exist in the underwater environment. Under such complex conditions and lack of prior knowledge, the existing path planning methods are difficult to plan, therefore they cannot effectively meet the actual demands. In response to these problems, a three-dimensional marine environment including multiple obstacles is established with the real ocean current data in this paper, which is consistent with the actual application scenarios. Then, we propose an N-step Priority Double DQN (NPDDQN) path planning algorithm, which potently realizes obstacle avoidance in the complex environment. In addition, this study proposes an experience screening mechanism, which screens the explored positive experience and improves its reuse rate, thus efficiently improving the algorithm stability in the dynamic environment. This paper verifies the better performance of reinforcement learning compared with a variety of traditional methods in three-dimensional underwater path planning. Underwater robots based on the proposed method have good autonomy and stability, which provides a new method for path planning of underwater robots. Note to Practitioners—The goal of this study is to provide a new solution for obstacle avoidance in path planning of underwater robots, which is consistent with the dynamic and real-time demands of the real environment. Existing underwater path planning researches lack a consistent environment with the actual application, and therefore we firstly construct a three-dimensional ocean environment with real ocean current data to provide support for the algorithms. Additionally, most of the algorithms are pre-planning methods or require long-time calculation, and there is little research on obstacle avoidance. In the face of obstacle changes, underwater robots with poor adaptability will cause performance decline and even economic losses. The proposed algorithm learns through interaction with the environment, and therefore it does not require any prior experience, and has good adaptability as well as fast inference speed. Especially, in the dynamic environment, algorithm performance is difficult to guarantee due to less positive experience in exploration. The proposed experience screening mechanism improves the stability of the algorithm, so that the underwater robot maintains stable performance in different dynamic environments.

Adaptive Energy-Efficient Reinforcement Learning for AUV 3D Motion Planning in Complex Underwater Environments

Learning an End-To-End Policy for AUV Control Within Just Forty Minutes Using Parallel Simulation

Path Planning of Autonomous Underwater Vehicles for Optimal Environmental Sampling

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

AUV Path Planning Based on Differential Evolution with Environment Prediction

End-to-End AUV Motion Planning Method Based on Soft Actor-Critic

An Information-Assisted Deep Reinforcement Learning Path Planning Scheme for Dynamic and Unknown Underwater Environment

Comprehensive Ocean Information-Enabled AUV Motion Planning Based on Reinforcement Learning

A Multi-Source-Data-Assisted AUV for Path Cruising: An Energy-Efficient DDPG Approach

Path planning of autonomous underwater vehicle in unknown environment based on improved deep reinforcement learning

Binocular Vision-Based Motion Planning of An AUV: A Deep Reinforcement Learning Approach

Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle

Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning

Cooperative Coverage Path Planning for AUVs in Integrated Underwater Acoustic Communication and Detection Networks

Underwater Multi-agent Cooperative Formation Hunting Based on Deep Reinforcement Learning

Path Planning of Unmanned Underwater Vehicles Based on Deep Reinforcement Learning Algorithm

Asynchronous Multithreading Reinforcement-Learning-Based Path Planning and Tracking for Unmanned Underwater Vehicle

AUV Path Planning with Kinematic Constraints in Unknown Environment Using Reinforcement Learning.

Path Planning based on Deep Reinforcement Learning for Autonomous Underwater Vehicles under Ocean Current Disturbance

Research and Design of an Autonomous Underwater Vehicle Path Planning Method Based on Deep Reinforcement Learning

Autonomous underwater vehicle path planning based on actor-multi-critic reinforcement learning