Learning a Semantic Prior for Guided Navigation

Yi Wu,Yuxin Wu,Georgia Gkioxari,Yuandong Tian,Aviv Tamar,Stuart J. Russell
2018-01-01
Abstract:Learning generalizable agents that can adapt to unseen environments remains an open problem in reinforcement learning. We consider visual navigation and address this problem by utilizing latent semantic regularity in human-designed 3D environments, aiming for generalization across scenarios that are visually diverse but semantically consistent. During training, the agent learns subpolicies to reach different semantic concepts, such as ‘move towards the kitchen’, and a prior distribution over their pairwise relationships, such as ‘kitchen is close to dining room’, in the form of a probabilistic graphical model. When testing on new scenarios, the agent dynamically updates its belief of the underlying semantic relationships during exploration and plans its route accordingly towards the final target in an interpretable manner. Our guided navigation method outperforms strong baselines which do not explicitly plan using the semantic content.
What problem does this paper attempt to address?