Abstract:Reinforcement learning (RL) has shown superior performance in solving sequential decision problems. In recent years, RL is gradually being used to solve unmanned driving collision avoidance decision-making problems in complex scenarios. However, ships encounter many scenarios, and the differences in scenarios will seriously hinder the application of RL in collision avoidance at sea. Moreover, the iterative speed of trial-and-error learning for RL in multi-ship encounter scenarios is slow. To solve this problem, this study develops a novel intelligent collision avoidance algorithm based on approximate representation reinforcement learning (AR-RL) to realize the collision avoidance of maritime autonomous surface ships (MASS) in a continuous state space environment involving interactive learning capability like a crew in navigation situation. The new algorithm uses an approximate representation model to deal with the optimization of collision avoidance strategies in a dynamic target encounter situation. The model is combined with prior knowledge and International Regulations for Preventing Collisions at Sea (COLREGs) for optimal performance. This is followed by a design of an online solution to a value function approximation model based on gradient descent. This approach can solve the problem of large-scale collision avoidance policy learning in static-dynamic obstacles mixed environment. Finally, algorithm tests were constructed though two scenarios (i.e., the coastal static obstacle environment and the static-dynamic obstacles mixed environment) using Tianjin Port as an example and compared with multiple groups of algorithms. The results show that the algorithm can improve the large-scale learning efficiency of continuous state space of dynamic obstacle environment by approximate representation. At the same time, the MASS can efficiently and safely avoid obstacles enroute to reaching its target destination. It therefore makes significant contributions to ensuring safety at sea in a mixed traffic involving both manned and MASS in near future.

COLERGs-constrained Safe Reinforcement Learning for Realising MASS's Risk-Informed Collision Avoidance Decision Making

Optimizing Anti-Collision Strategy for MASS: A Safe Reinforcement Learning Approach to Improve Maritime Traffic Safety

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning

A COLREGs-Compliant Deep Reinforcement Learning Approach

Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance

A human-like collision avoidance method for autonomous ship with attention-based deep reinforcement learning

Navigation Behavioural Decision-Making of MASS Based on Deep Reinforcement Learning and Artificial Potential Field

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

An Improved NSGA-II Algorithm for MASS Autonomous Collision Avoidance under COLREGs

Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning

Collision avoidance decision-making strategy for multiple USVs based on Deep Reinforcement Learning algorithm

Adaptive Collision Avoidance Decisions in Autonomous Ship Encounter Scenarios Through Rule-Guided Vision Supervised Learning

Deep Reinforcement Learning Based Path Planning and Collision Avoidance for Smart Ships in Complex Environments

Navigation Situation Adaptive Learning-Based Path Planning of Maritime Autonomous Surface Ships

Autonomous navigation of marine surface vessel in extreme encounter situation

A Novel Reinforcement Learning Collision Avoidance Algorithm for USVs Based on Maneuvering Characteristics and COLREGs

Efficient Reinforcement Learning for Autonomous Ship Collision Avoidance under Learning Experience Reuse

Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships

A Novel Deep Reinforcement Learning for POMDP-based Autonomous Ship Collision Decision-Making

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments