Trajectory Evaluation Method Based on Intention Analysis

JIN Zhuo-jun,QIAN Hui,ZHU Miao-liang
DOI: https://doi.org/10.3785/j.issn.1008-973x.2011.10.006
2011-01-01
Abstract:The trajectory evaluation problem when a demonstration from an expert is available was investigated through inverse reinforcement learning and reward reshaping technique under policy invariance.A novel intention-based method was presented.The weights of the given trajectory and the demonstration were determined with respect to a fixed group of features.The linear subspaces spanned by these two weight vectors were computed by using the reward reshaping technique.The norm of the orthogonal projections was calculated and used to measure the difference between subspaces.In the four-wheel vehicle simulation experiment,the approach was tested by applying it to trajectories generated in several typical scenarios.Empirical results showed that,for the given trajectories,the approach can yield reasonable marks in finite steps according to the difference between the given trajectory and demonstration.The approach can eliminate the ambiguity brought by the inherent ill-posedness of inverse problems,and overcome the difficulties of trajectory evaluation.
What problem does this paper attempt to address?