Abstract:Navigating complex indoor environments requires a deep understanding of the space the robotic agent is acting into to correctly inform the navigation process of the agent towards the goal location. In recent learning-based navigation approaches, the scene understanding and navigation abilities of the agent are achieved simultaneously by collecting the required experience in simulation. Unfortunately, even if simulators represent an efficient tool to train navigation policies, the resulting models often fail when transferred into the real world. One possible solution is to provide the navigation model with mid-level visual representations containing important domain-invariant properties of the scene. But, what are the best representations that facilitate the transfer of a model to the real-world? How can they be combined? In this work we address these issues by proposing a benchmark of Deep Learning architectures to combine a range of mid-level visual representations, to perform a PointGoal navigation task following a Reinforcement Learning setup. All the proposed navigation models have been trained with the Habitat simulator on a synthetic office environment and have been tested on the same real-world environment using a real robotic platform. To efficiently assess their performance in a real context, a validation tool has been proposed to generate realistic navigation episodes inside the simulator. Our experiments showed that navigation models can benefit from the multi-modal input and that our validation tool can provide good estimation of the expected navigation performance in the real world, while saving time and resources. The acquired synthetic and real 3D models of the environment, together with the code of our validation tool built on top of Habitat, are publicly available at the following link: <a class="link-external link-https" href="https://iplab.dmi.unict.it/EmbodiedVN/" rel="external noopener nofollow">this https URL</a>

Learning to Navigate in Complex Environments

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

Learning Dynamic Cognitive Map with Autonomous Navigation

End-to-End Navigation in Unknown Environments using Neural Networks

Building Intelligent Autonomous Navigation Agents

Multi-Object Navigation with dynamically learned neural implicit representations

Visual Navigation with Multiple Goals Based on Deep Reinforcement Learning

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation

Multi-Object Navigation in real environments using hybrid policies

Learning Autonomous Navigation in Unmapped and Unknown Environments

Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning

A Role of Environmental Complexity on Representation Learning in Deep Reinforcement Learning Agents

Navigational Behavior of Humans and Deep Reinforcement Learning Agents

Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning

Deep Reinforcement Learning for Navigation in AAA Video Games

Learning Exploration Policies for Navigation

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Learning to navigate efficiently and precisely in real environments

Autonomous Navigation in Complex Environments

Hierarchical Representations and Explicit Memory: Learning Effective Navigation Policies on 3D Scene Graphs using Graph Neural Networks

Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach