Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots

Victor Augusto Kich,Alisson Henrique Kolling,Junior Costa de Jesus,Gabriel V. Heisler,Hiago Jacobs,Jair Augusto Bottega,André L. da S. Kelbouscas,Akihisa Ohya,Ricardo Bedin Grando,Paulo Lilles Jorge Drews-Jr,Daniel Fernando Tello Gamarra
2024-09-01
Abstract:This paper introduces novel deep reinforcement learning (Deep-RL) techniques using parallel distributional actor-critic networks for navigating terrestrial mobile robots. Our approaches use laser range findings, relative distance, and angle to the target to guide the robot. We trained agents in the Gazebo simulator and deployed them in real scenarios. Results show that parallel distributional Deep-RL algorithms enhance decision-making and outperform non-distributional and behavior-based approaches in navigation and spatial generalization.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve map - free navigation of ground - moving robots in complex environments. Specifically, the researchers proposed a new deep reinforcement learning (Deep - RL) technique, using parallel - distributed actor - critic networks to guide robot navigation. These methods utilize laser ranging data, the relative distance and angle of the target to guide the robot, and train agents in the Gazebo simulator, and then deploy them in real - world scenarios. The main contributions of the paper include: 1. Proposing two new distributed deep reinforcement learning methods, namely parallel - distributed deterministic reinforcement learning (PDDRL) and parallel - distributed stochastic reinforcement learning (PDSRL), for improving goal - oriented map - free navigation based on a simple range - sensing architecture. 2. Demonstrating the feasibility of these methods through simulation - to - real evaluation, solving challenges such as inaccuracy and delay. 3. Evidence shows that the stochastic actor - critic technique using prioritized experience replay is superior to non - distributed techniques and classical algorithms. This is the first extensive simulation - to - real map - free navigation evaluation of ground - moving robots using parallel - distributed deep reinforcement learning methods. Through these contributions, the paper aims to improve the autonomous navigation ability of ground - moving robots in complex environments, especially in the absence of pre - constructed maps.