Abstract:Reinforcement learning (RL) has shown superior performance in solving sequential decision problems. In recent years, RL is gradually being used to solve unmanned driving collision avoidance decision-making problems in complex scenarios. However, ships encounter many scenarios, and the differences in scenarios will seriously hinder the application of RL in collision avoidance at sea. Moreover, the iterative speed of trial-and-error learning for RL in multi-ship encounter scenarios is slow. To solve this problem, this study develops a novel intelligent collision avoidance algorithm based on approximate representation reinforcement learning (AR-RL) to realize the collision avoidance of maritime autonomous surface ships (MASS) in a continuous state space environment involving interactive learning capability like a crew in navigation situation. The new algorithm uses an approximate representation model to deal with the optimization of collision avoidance strategies in a dynamic target encounter situation. The model is combined with prior knowledge and International Regulations for Preventing Collisions at Sea (COLREGs) for optimal performance. This is followed by a design of an online solution to a value function approximation model based on gradient descent. This approach can solve the problem of large-scale collision avoidance policy learning in static-dynamic obstacles mixed environment. Finally, algorithm tests were constructed though two scenarios (i.e., the coastal static obstacle environment and the static-dynamic obstacles mixed environment) using Tianjin Port as an example and compared with multiple groups of algorithms. The results show that the algorithm can improve the large-scale learning efficiency of continuous state space of dynamic obstacle environment by approximate representation. At the same time, the MASS can efficiently and safely avoid obstacles enroute to reaching its target destination. It therefore makes significant contributions to ensuring safety at sea in a mixed traffic involving both manned and MASS in near future.

Autonomous spacecraft collision avoidance with a variable number of space debris based on safe reinforcement learning

Autonomous Spacecraft Collision Avoidance with Multiple Space Debris Based on Reinforcement Learning

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Research on collision avoidance algorithm of unmanned surface vehicle based on deep reinforcement learning

Collision avoidance decision-making strategy for multiple USVs based on Deep Reinforcement Learning algorithm

Coordinated Control Based on Reinforcement Learning for Dual-Arm Continuum Manipulators in Space Capture Missions

Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous

Mission Planning on Autonomous Avoidance for Spacecraft Confronting Orbital Debris

Proximal policy optimization with reciprocal velocity obstacle based collision avoidance path planning for multi-unmanned surface vehicles

A partially observable multi-ship collision avoidance decision-making model based on deep reinforcement learning

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments

Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning

AI-Driven Risk-Aware Scheduling for Active Debris Removal Missions

A COLREGs-Compliant Deep Reinforcement Learning Approach

Active Debris Removal Mission Planning Method Based on Machine Learning

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Deep Reinforcement Learning Based Path Planning and Collision Avoidance for Smart Ships in Complex Environments

Path planning and dynamic collision avoidance algorithm under COLREGs via deep reinforcement learning

Autonomous obstacle avoidance strategies in the mission of large space debris removal using potential function