Semantic Path Planning for Indoor Navigation Tasks Using Multi-View Context and Prior Knowledge.

Jianbing Wu,Weibo Huang,Guoliang Hua,Wanruo Zhang,Risheng Kang,Hong Liu
DOI: https://doi.org/10.1587/transinf.2022dlp0033
2023-01-01
IEICE Transactions on Information and Systems
Abstract:Recently, deep reinforcement learning (DRL) methods have significantly improved the performance of target-driven indoor nav-igation tasks. However, the rich semantic information of environments is still not fully exploited in previous approaches. In addition, existing meth-ods usually tend to overfit on training scenes or objects in target-driven navigation tasks, making it hard to generalize to unseen environments. Hu-man beings can easily adapt to new scenes as they can recognize the objects they see and reason the possible locations of target objects using their expe-rience. Inspired by this, we propose a DRL-based target-driven navigation model, termed MVC-PK, using Multi-View Context information and Prior semantic Knowledge. It relies only on the semantic label of target objects and allows the robot to find the target without using any geometry map. To perceive the semantic contextual information in the environment, ob-ject detectors are leveraged to detect the objects present in the multi-view observations. To enable the semantic reasoning ability of indoor mobile robots, a Graph Convolutional Network is also employed to incorporate prior knowledge. The proposed MVC-PK model is evaluated in the AI2-THOR simulation environment. The results show that MVC-PK (1) sig-nificantly improves the cross-scene and cross-target generalization ability, and (2) achieves state-of-the-art performance with 15.2% and 11.0% in-crease in Success Rate (SR) and Success weighted by Path Length (SPL), respectively.
What problem does this paper attempt to address?