Long-Sighted Imitation Learning for Partially Observable Control

Bo Xiong,Fangshi Wang,Chao Yu,Fei Qiao,Yi Yang,Qi Wei,Xin-Jun Liu
DOI: https://doi.org/10.1145/3387304.3387320
2020-01-01
Abstract:Imitation Learning (IL) has facilitated many effective and efficient controllers for autonomous agents. Nevertheless, current methods suffer from severe partial observability problems when given incomplete observations, leading to short-sighted behaviors in decision-making tasks. To overcome these shortcomings, this paper presents a Long-Sighted Imitation Learning approach by expanding visual perception and memory size. Firstly, we utilize a deep siamese network that take both current observation and goal state as input, which is especially effective on goal-oriented tasks. Furthermore, inspired by the success of Deep Recurrent Q-Network (DRQN), we introduce recurrency into imitation learning by appending a Gated Recurrent Units (GRUs) layer right after the last fully-connected layer. Extensive experiments on goal-oriented navigation tasks demonstrate that our method outweigh current counterparts.
What problem does this paper attempt to address?