Collision Anticipation via Deep Reinforcement Learning for Visual Navigation

Eduardo Gutiérrez-Maestro,Roberto J. López-Sastre,Saturnino Maldonado-Bascón
DOI: https://doi.org/10.1007/978-3-030-31332-6_34
2019-01-01
Pattern Recognition and Image Analysis
Abstract:Visual navigation is the ability of an autonomous agent to find its way in a large and complex environment based on visual information. It is indeed a fundamental problem in computer vision and robotics. In this paper, we propose a deep reinforcement learning approach which is able to learn to navigate a scene to reach a given visual target, but anticipating the possible collisions with the environment. Technically, we propose a map-less-based model, which follows an actor-critic reinforcement learning method where the reward function has been designed to be collision aware. We offer a thorough experimental evaluation of our solution in the AI2-THOR virtual environment, where the results show that our proposed method: (1) improves the state of the art in terms of number of steps and collisions; (2) is able to converge faster than a model which does not care about the collisions, simply searching for the shortest paths; and (3) offers an interesting generalization capability to reach visual targets that have never been seen during training.
What problem does this paper attempt to address?