Intelligent Navigation Method for Multiple Marine Autonomous Surface Ships Based on Improved PPO Algorithm

Zhewen Cui,Wei Guan,Wenzhe Luo,Xianku Zhang
DOI: https://doi.org/10.1016/j.oceaneng.2023.115783
IF: 5
2023-01-01
Ocean Engineering
Abstract:To achieve the autonomous navigation of multiple Marine Autonomous Surface Ship (MASS), an intelligent planning and decision-making method for MASS based on Rapidly-exploring Random Trees star (RRT-star) and improved Proximal Policy Optimization (PPO) algorithm is proposed. The novelty of the study is: (1) enhancing the traditional PPO algorithm by the Generalized Advantage Estimation (GAE) and the Long Short-Term Memory (LSTM) network, which facilitate accelerated convergence of average reward, predict the state space, and enhance accuracy in estimating the advantage function. (2) proposing a complete reward function, which can not only guide MASS to navigate towards the waypoint, but also ensure that MASS complies with the COLREGs during collision avoidance. It is worth highlighting that the trained network model can be generalized to different scenarios. We attached the trained neural network model to different MASS and simulated it in different open and narrow waters. Even in emergency situations, the method can still make it deviate from COLREGs and make flexible collision avoidance decisions. The results show that this method can handle the multi-MASS encounter situation well and make MASS reach the waypoint safely.
What problem does this paper attempt to address?