Abstract:Extensive studies have shown that many animals’ capability of forming spatial representations for self-localization, path planning, and navigation relies on the functionalities of place and head-direction (HD) cells in the hippocampus. Although there are numerous hippocampal modeling approaches, only a few span the wide functionalities ranging from processing raw sensory signals to planning and action generation. This paper presents a vision-based navigation system that involves generating place and HD cells through learning from visual images, building topological maps based on learned cell representations and performing navigation using hierarchical reinforcement learning. First, place and HD cells are trained from sequences of visual stimuli in an unsupervised learning fashion. A modified Slow Feature Analysis (SFA) algorithm is proposed to learn different cell types in an intentional way by restricting their learning to separate phases of the spatial exploration. Then, to extract the encoded metric information from these unsupervised learning representations, a self-organized learning algorithm is adopted to learn over the emerged cell activities and to generate topological maps that reveal the topology of the environment and information about a robot’s head direction, respectively. This enables the robot to perform self-localization and orientation detection based on the generated maps. Finally, goal-directed navigation is performed using reinforcement learning in continuous state spaces which are represented by the population activities of place cells. In particular, considering that the topological map provides a natural hierarchical representation of the environment, hierarchical reinforcement learning (HRL) is used to exploit this hierarchy to accelerate learning. The HRL works on different spatial scales, where a high-level policy learns to select subgoals and a low-level policy learns over primitive actions to specialize on the selected subgoals. Experimental results demonstrate that our system is able to navigate a robot to the desired position effectively, and the HRL shows a much better learning performance than the standard RL in solving our navigation tasks.

Multiple Self-Supervised Auxiliary Tasks for Target-Driven Visual Navigation Using Deep Reinforcement Learning.

Visual Navigation with Multiple Goals Based on Deep Reinforcement Learning

Target-driven Indoor Visual Navigation Using Inverse Reinforcement Learning

Discovering Intrinsic Subgoals for Vision-and-Language Navigation via Hierarchical Reinforcement Learning

Boosting Efficient Reinforcement Learning for Vision-and-Language Navigation with Open-Sourced LLM

Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks

Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships

Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

Target-driven Visual Navigation in Indoor Scenes Using Reinforcement Learning and Imitation Learning

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

Autonomous Multi-View Navigation Via Deep Reinforcement Learning

Skill-Based Hierarchical Reinforcement Learning for Target Visual Navigation

Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning

Reinforcement Learning-Based Visual Navigation With Information-Theoretic Regularization

Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning

Vision-and-Language Navigation via Latent Semantic Alignment Learning

Learning On-Road Visual Control for Self-Driving Vehicles with Auxiliary Tasks

Multimodal fusion for autonomous navigation via deep reinforcement learning with sparse rewards and hindsight experience replay

Deep Reinforcement Learning Visual Navigation Model Integrating Memory-prediction Mechanism

VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training

Visual Hindsight Self-Imitation Learning for Interactive Navigation