Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning

Jorge de Heuvel,Weixian Shi,Xiangyu Zeng,Maren Bennewitz
2023-03-03
Abstract:Collision-free, goal-directed navigation in environments containing unknown static and dynamic obstacles is still a great challenge, especially when manual tuning of navigation policies or costly motion prediction needs to be avoided. In this paper, we therefore propose a subgoal-driven hierarchical navigation architecture that is trained with deep reinforcement learning and decouples obstacle avoidance and motor control. In particular, we separate the navigation task into the prediction of the next subgoal position for avoiding collisions while moving toward the final target position, and the prediction of the robot's velocity controls. By relying on 2D lidar, our method learns to avoid obstacles while still achieving goal-directed behavior as well as to generate low-level velocity control commands to reach the subgoals. In our architecture, we apply the attention mechanism on the robot's 2D lidar readings and compute the importance of lidar scan segments for avoiding collisions. As we show in simulated and real-world experiments with a Turtlebot robot, our proposed method leads to smooth and safe trajectories among humans and significantly outperforms a state-of-the-art approach in terms of success rate. A supplemental video describing our approach is available online.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve collision - free, goal - oriented navigation in an environment containing unknown static and dynamic obstacles. Specifically, the paper proposes a deep reinforcement learning method based on the attention mechanism, aiming to avoid manual parameter tuning or high - cost motion prediction in order to achieve effective navigation of robots in complex environments. The paper focuses in particular on decomposing the navigation task into two parts: one is to predict the next sub - goal position to avoid collisions while moving towards the final goal; the other is to predict the speed control commands of the robot. Through this hierarchical architecture, this method can utilize 2D LiDAR data to learn how to safely reach the target position in the presence of unknown obstacles and generate low - level speed control commands to achieve this goal. Experimental results show that, compared with existing advanced methods, this method has significant advantages in terms of success rate, especially in simulation and real - world experiments, enabling the robot to achieve smooth and safe trajectories in indoor environments with crowds.