Gaussian processes in inverse reinforcement learning

Zhuo-Jun Jin,Hui Qian,Miaoliang Zhu
DOI: https://doi.org/10.1109/ICMLC.2010.5581063
2010-01-01
Abstract:Inverse reinforcement learning (IRL) is the general problem of recovering a reward function from demonstrations provided by an expert. By incorporating Gaussian process (GP) into IRL, we present an approach to recovering both rewards and uncertainty information in continuous state and action spaces. To predicate value in every point in spaces, we use GP models for value function and reward function separately. Our contribution is threefold: First, we extend the existing IRL algorithm to the case of continuous spaces. Second, reward GP provides not only the reward function with flexible forms, but also uncertainty about rewards, which helps the learner make a tradeoff between exploitation and exploration. Third, by introducing the kernel function, our approach takes sample points in the demonstration as learning features. It prevents manually designating features. Experimental results show the proposed method works well and demonstrate good learning in a traditional learning setting.
What problem does this paper attempt to address?