A knowledge-free path planning approach for smart ships based on reinforcement learning

Chen Chen,Xian-Qiao Chen,Feng Ma,Xiao-Jun Zeng,Jin Wang

DOI: https://doi.org/10.1016/j.oceaneng.2019.106299

IF: 5

2019-10-01

Ocean Engineering

Abstract:The autonomous navigation of smart ships needs to meet their huge inertia and obey existing complex rules. A smart ship has to realise autonomous driving instead of manual operation, which consists of path planning and controlling. Toward to this goal, this research proposes a path planning and manipulating approach based on Q-learning, which can drive a cargo ship by itself without requiring any input from human experiences. At the very beginning, a ship is modelled with the Nomoto model in a simulation waterway. Then, distances, obstacles and prohibited areas are regularized as rewards or punishments, which are used to judge the performance, or manipulation decisions of the ship. Subsequently, Q-learning is introduced to learn the action–reward model and the learning outcome is used to manipulate the ship's movement. By chasing higher reward values, the ship can find an appropriate path or navigation strategies by itself. After a sufficient number of rounds of training, a convincing path and manipulating strategies will likely be produced. By comparing the proposed approach with the existing methods, it is shown that this approach is more effective in self-learning and continuous optimisation, and therefore closer to human manoeuvring.

engineering, civil, ocean, marine,oceanography

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the path planning problem in intelligent ship autonomous navigation. Specifically, the paper focuses on how to use reinforcement learning methods (especially Q - learning) to achieve the autonomous driving of cargo ships without the input of human experience. This includes two parts: path planning and control, aiming to meet the requirements of the huge inertia of cargo ships and complex navigation rules, while overcoming challenges such as dynamic environments, insufficient power and perceptual uncertainty. Traditional path planning methods such as A* algorithm, artificial potential field method (APF), rapidly - exploring random tree (RRT), etc., although perform well in land robots, are often not suitable for the navigation requirements considering the dynamic characteristics of cargo ships. Therefore, the paper proposes a path planning method based on Q - learning. Through a large number of trainings in the simulated environment, the intelligent ship can autonomously find a suitable path or navigation strategy, thereby achieving autonomous navigation closer to human - operated intelligence.

A knowledge-free path planning approach for smart ships based on reinforcement learning

An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

Deep Reinforcement Learning-Based Path Control and Optimization for Unmanned Ships

Improved reinforcement learning for collision-free local path planning of dynamic obstacle

Deep Reinforcement Learning Based Path Planning and Collision Avoidance for Smart Ships in Complex Environments

Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance

A novel path planning approach for smart cargo ships based on anisotropic fast marching

AUV Path Planning with Kinematic Constraints in Unknown Environment Using Reinforcement Learning.

Long-Range Risk-Aware Path Planning for Autonomous Ships in Complex and Dynamic Environments

Soft Actor-Critic and Risk Assessment-Based Reinforcement Learning Method for Ship Path Planning

Safety Aware Autonomous Path Planning Using Model Predictive Reinforcement Learning for Inland Waterways

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

Spatial-temporal recurrent reinforcement learning for autonomous ships

A Method for Coastal Global Route Planning of Unmanned Ships Based on Human-like Thinking

Autonomous ship navigation with an enhanced safety collision avoidance technique

High-Level Path Planning for an Autonomous Sailboat Robot Using Q-Learning

A path planning strategy unified with a COLREGS collision avoidance function based on deep reinforcement learning and artificial potential field

Port Channel Navigation Subjected to Environmental Conditions Using Reinforcement Learning

Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation

Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning