Using Deep Reinforcement Learning Methods for Autonomous Vessels in 2D Environments

Mohammad Etemad,Nader Zare,Mahtab Sarvmaili,Amilcar Soares,Bruno Brandoli Machado,Stan Matwin
DOI: https://doi.org/10.48550/arXiv.2003.10249
2020-03-23
Abstract:Unmanned Surface Vehicles technology (USVs) is an exciting topic that essentially deploys an algorithm to safely and efficiently performs a mission. Although reinforcement learning is a well-known approach to modeling such a task, instability and divergence may occur when combining off-policy and function approximation. In this work, we used deep reinforcement learning combining Q-learning with a neural representation to avoid instability. Our methodology uses deep q-learning and combines it with a rolling wave planning approach on agile methodology. Our method contains two critical parts in order to perform missions in an unknown environment. The first is a path planner that is responsible for generating a potential effective path to a destination without considering the details of the root. The latter is a decision-making module that is responsible for short-term decisions on avoiding obstacles during the near future steps of USV exploitation within the context of the value function. Simulations were performed using two algorithms: a basic vanilla vessel navigator (VVN) as a baseline and an improved one for the vessel navigator with a planner and local view (VNPLV). Experimental results show that the proposed method enhanced the performance of VVN by 55.31 on average for long-distance missions. Our model successfully demonstrated obstacle avoidance by means of deep reinforcement learning using planning adaptive paths in unknown environments.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to solve the path planning and obstacle avoidance problems faced by unmanned surface vessels (USVs) during mission execution. Specifically, the author aims to enhance the autonomous navigation ability of unmanned vessels in unknown environments by combining deep reinforcement learning and local view strategies. ### Main problems: 1. **Path planning**: How to generate a safe and efficient path from the starting point to the target point. 2. **Obstacle avoidance**: How to dynamically avoid obstacles in an unknown environment and ensure that the vessel reaches the destination safely. ### Solutions: To address the above challenges, the author proposes a new method - **Vessel Navigator with Planner and Local View (VNPLV)**, which utilizes the following techniques: - **Deep Reinforcement Learning (DRL)**: Through the Deep Q - Learning algorithm, the model can make decisions in complex environments. - **Rolling Wave Planning**: This is a concept in agile methodology, combining long - term planning and short - term decision - making to adapt to the dynamic changes of the environment. - **Path simplification**: Use the Ramer - Douglas - Peucker algorithm to reduce redundant points in the path and improve flexibility. - **Local view strategy**: Only focus on the local area around the vessel, reduce the complexity of the state space, and at the same time avoid the influence of distant static obstacles on decision - making. ### Experimental results: Experiments show that the VNPLV method has a 55.31% performance improvement compared to the basic Vanilla Vessel Navigator (VVN) method in long - distance tasks, and can avoid obstacles more effectively and reach the destination. ### Summary: The main contribution of this paper is to develop a new method that combines deep reinforcement learning and local view strategies for path planning and obstacle avoidance of unmanned surface vessels. This method not only improves the autonomous navigation ability of vessels in unknown environments but also shows the potential of applying deep learning techniques in complex marine environments.