Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning

Qiaoyun Wu,Xiaoxi Gong,Kai Xu,Dinesh Manocha,Jingxuan Dong,Jun Wang
DOI: https://doi.org/10.1109/lra.2020.3036597
IF: 5.2
2021-01-01
IEEE Robotics and Automation Letters
Abstract:We present a target-driven navigation system to improve mapless visual navigation in indoor scenes. Our method takes a multi-view observation of a robot and a target image as inputs at each time step to provide a sequence of actions that move the robot to the target without relying on odometry or GPS at runtime. The system is learned by optimizing a combinational objective encompassing three key designs. First, we propose that an agent conceives the next observation before making an action decision. This is achieved by learning a variational generative module from expert demonstrations. We then propose predicting static collision in advance, as an auxiliary task to improve safety during navigation. Moreover, to alleviate the training data imbalance problem of termination action prediction, we also introduce a target checking module to differentiate from augmenting navigation policy with a termination action. The three proposed designs all contribute to the improved training data efficiency, static collision avoidance, and navigation generalization performance, resulting in a novel target-driven mapless navigation system. Through experiments on a TurtleBot, we provide evidence that our model can be integrated into a robotic system and navigate in the real world. Videos and models can be found in the supplementary material.
robotics
What problem does this paper attempt to address?
The paper aims to address the problem of goal-driven navigation for robots in unexplored indoor scenes. Specifically, the research objectives include: 1. **Improving Mapless Visual Navigation**: Proposing a novel navigation system to enhance the robot's ability to autonomously navigate indoor environments without relying on maps, odometry, or GPS. 2. **Multimodal Decision Processing**: Introducing a variational generative module to address the multimodal issues in navigation decisions, thereby improving the efficiency of training data and the quality of decisions. 3. **Predicting Collisions in Advance**: Proposing the prediction of static collisions as an auxiliary task to enhance safety during the navigation process. 4. **Optimizing Goal Detection**: Introducing a goal-checking module to determine whether the robot has reached the target location, thus avoiding issues caused by imbalanced training data. 5. **Comprehensive Performance Enhancement**: Integrating the aforementioned technologies to achieve more efficient data utilization, avoid static obstacles, and enhance generalization across different scenes and targets. 6. **Experimental Validation**: Demonstrating through experiments that the proposed model can be successfully integrated into robotic systems and navigate in real-world environments. In summary, this paper is primarily dedicated to developing a goal-driven navigation method based on generative imitation learning to solve the problem of autonomous navigation for robots in unknown indoor environments, and to improve navigation performance through a series of technical and design enhancements.