Abstract:The research on decision-making models of ship collision avoidance is confronted with numerous challenges. These challenges encompass inadequate consideration of complex factors, including but not limited to open water scenarios, the absence of static obstacle considerations, and insufficient attention given to avoiding collisions between manned ships and MASSs. A decision model for MASS collision avoidance is proposed to overcome these limitations by integrating the strengths of model-based and model-free methods in reinforcement learning. This model incorporates S-57 chart information, AIS data, and the Dyna framework to improve effectiveness. (1) When the MASS's navigation task is known, a static navigation environment is built based on S-57 chart information, and the Voronoi diagram and improved A* algorithm are used to obtain the energy-saving optimal static path as the planned sea route. (2) Given the small main dimensions of an MASS, which is easily affected by wind and current factors, the motion model of an MASS is established based on the MMG model considering wind and current factors. At the same time, AIS data are used to extract the target ship (manned ship) data. (3) According to the characteristics of the actual navigation of ships at sea, the state space, action space, and reward function of the reinforcement learning algorithm are designed. The MASS collision avoidance decision model based on the Dyna-DQN model is established. Based on the DQN algorithm, the agent (MASS) and the environment interact continuously, and the actual interaction data generated are used for the iterative update of the collision avoidance strategy and the training of the environment model. Then, the environment model is used to generate a series of simulated empirical data to promote the iterative update of the strategy. Using the waters near the South China Sea as the research object for simulation verification, the navigation tasks are divided into three categories: only considering static obstacles, following the planned sea route considering static obstacles, and following the planned sea route considering both static and dynamic obstacles. The results show that through repeated simulation experiments, an MASS can complete the navigation task without colliding with static and dynamic obstacles. Therefore, the proposed method can be used in the intelligent collision avoidance module of MASSs and is an effective MASS collision avoidance method.

Navigation Behavioural Decision-Making of MASS Based on Deep Reinforcement Learning and Artificial Potential Field

Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning

Research on MASS Collision Avoidance in Complex Waters Based on Deep Reinforcement Learning

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

Navigation Situation Adaptive Learning-Based Path Planning of Maritime Autonomous Surface Ships

Autonomous navigation of marine surface vessel in extreme encounter situation

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

COLERGs-constrained Safe Reinforcement Learning for Realising MASS's Risk-Informed Collision Avoidance Decision Making

Optimizing Anti-Collision Strategy for MASS: A Safe Reinforcement Learning Approach to Improve Maritime Traffic Safety

Autonomous Navigation Decision-Making Method for a Smart Marine Surface Vessel Based on an Improved Soft Actor–Critic Algorithm

Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships

Formation of MASS Collision Avoidance and Path following Based on Artificial Potential Field in Constrained Environment

Enhanced artificial potential field for MASS's path planning navigation in restricted waterways

Method for Intelligent Obstacle Avoidance Decision-Making of Unmanned Vessel in Unknown Waters

Intelligent Navigation Method for Multiple Marine Autonomous Surface Ships Based on Improved PPO Algorithm

An Improved Approach Towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

Research on the Decision-Making and Control System Architecture for Autonomous Berthing of MASS

A novel model predictive artificial potential field based ship motion planning method considering COLREGs for complex encounter scenarios

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

A COLREGs-Compliant Deep Reinforcement Learning Approach

Optimizing Multi-Vessel Collision Avoidance Decision Making for Autonomous Surface Vessels: A COLREGs-Compliant Deep Reinforcement Learning Approach