Abstract:As one of the commonly used vehicles for underwater detection, underwater robots are facing a series of problems. The real underwater environment is large-scale, complex, real-time and dynamic, and many unknown obstacles may exist in the underwater environment. Under such complex conditions and lack of prior knowledge, the existing path planning methods are difficult to plan, therefore they cannot effectively meet the actual demands. In response to these problems, a three-dimensional marine environment including multiple obstacles is established with the real ocean current data in this paper, which is consistent with the actual application scenarios. Then, we propose an N-step Priority Double DQN (NPDDQN) path planning algorithm, which potently realizes obstacle avoidance in the complex environment. In addition, this study proposes an experience screening mechanism, which screens the explored positive experience and improves its reuse rate, thus efficiently improving the algorithm stability in the dynamic environment. This paper verifies the better performance of reinforcement learning compared with a variety of traditional methods in three-dimensional underwater path planning. Underwater robots based on the proposed method have good autonomy and stability, which provides a new method for path planning of underwater robots. Note to Practitioners—The goal of this study is to provide a new solution for obstacle avoidance in path planning of underwater robots, which is consistent with the dynamic and real-time demands of the real environment. Existing underwater path planning researches lack a consistent environment with the actual application, and therefore we firstly construct a three-dimensional ocean environment with real ocean current data to provide support for the algorithms. Additionally, most of the algorithms are pre-planning methods or require long-time calculation, and there is little research on obstacle avoidance. In the face of obstacle changes, underwater robots with poor adaptability will cause performance decline and even economic losses. The proposed algorithm learns through interaction with the environment, and therefore it does not require any prior experience, and has good adaptability as well as fast inference speed. Especially, in the dynamic environment, algorithm performance is difficult to guarantee due to less positive experience in exploration. The proposed experience screening mechanism improves the stability of the algorithm, so that the underwater robot maintains stable performance in different dynamic environments.

Path Planning of Unmanned Surface Vehicle Based on Improved Q-Learning Algorithm

Learning and Sampling-Based Informative Path Planning for AUVs in Ocean Current Fields

A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning

Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning

A Hybrid Path Planning Algorithm for Unmanned Surface Vehicles in Complex Environment with Dynamic Obstacles.

A novel reinforcement learning based tuna swarm optimization algorithm for autonomous underwater vehicle path planning

An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

Optimal Path Planning of Unmanned Surface Vehicle under Current Environment

Path Planning for Unmanned Surface Vehicles with Strong Generalization Ability Based on Improved Proximal Policy Optimization

An Optimal Control-Based Path Planning Method for Unmanned Surface Vehicles in Complex Environments

An Improved Quantum-Behaved Particle Swarm Optimization Algorithm Combined with Reinforcement Learning for AUV Path Planning

Global path planning algorithm based on double DQN for multi-tasks amphibious unmanned surface vehicle

Global Path Planning for Unmanned Surface Vehicle Based on Improved Quantum Ant Colony Algorithm

Collision Avoidance and Path Point Tracking Control for Underactuated Unmanned Surface Vehicles with Unknown Model Nonlinearity

Local Path Planning for Unmanned Surface Vehicle Collision Avoidance Based on Modified Quantum Particle Swarm Optimization

A path planning strategy unified with a COLREGS collision avoidance function based on deep reinforcement learning and artificial potential field

Unmanned Surface Vehicle Aided Maritime Data Collection Using Deep Reinforcement Learning

Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning

Proximal policy optimization with reciprocal velocity obstacle based collision avoidance path planning for multi-unmanned surface vehicles

Using Deep Reinforcement Learning Methods for Autonomous Vessels in 2D Environments

Achieving optimal-dynamic path planning for unmanned surface vehicles: A rational multi-objective approach and a sensory-vector re-planner