Motion Planning Using Reinforcement Learning Method for Underactuated Ship Berthing.

Haoran Zhang,Chenkun Yin,Yanxin Zhang
DOI: https://doi.org/10.1109/icca51439.2020.9264562
2020-01-01
Abstract:This paper proposes a novel motion planning method for underactuated ship berthing using reinforcement learning (RL) technique. The berthing motion planning problem is formulated as a Markov Decision Process, where a specified reward function is designed for the accurate berthing task. The problem is addressed by a state-of-art RL algorithm, Twin Delayed Deep Deterministic Policy Gradient (TD3). The generated trajectories are feasible for the ship to accomplish berthing task, since the system constraints are fully taken into consideration when the trained agent interacts with environment by RL. Simulation results verify the effectiveness of RL based motion planning method, and the advantage of TD3 is shown by comparison with Deep Deterministic Policy Gradient for the same task.
What problem does this paper attempt to address?