Depth-Guided AdaIN and Shift Attention Network for Vision-And-Language Navigation

Qiang Sun,Yifeng Zhuang,Zhengqing Chen,Yanwei Fu,Xiangyang Xue
DOI: https://doi.org/10.1109/ICME51207.2021.9428422
2021-01-01
Abstract:Visual Language Navigation (VLN) is the grand goal of AI, which enables the agent to act by the language instructions from humans. In VLN task, the agent learns to search for a specific region described by the instructions in the training environments, and performs the navigation in the unseen environments. Normally, there exists a large domain gap be-tween the seen and unseen environments. Numero...
What problem does this paper attempt to address?