Learning a Reliable Decision Making Policy for Robust Tracking

Xiaofeng Huang,Kanghao Wang,Haibing Yin,Shengsheng Zheng,Xiang Meng,Shengping Zhang
DOI: https://doi.org/10.1109/VCIP47243.2019.8965745
2019-01-01
Abstract:Recent years deep learning based visual object trackers have achieved state-of-the-art performance on multiple benchmarks. However, most of these trackers lack an effective mechanism to avoid the wrong template update or re-detect the object when unreliable tracking result appears. In this paper, a novel tracking framework consisting of a tracking network for locating the target and a policy network for decision making is proposed. Firstly, during the off-line training phase, a variant of policy gradient algorithm is adopted, which makes the model converge better and faster. Secondly, current response map and history response map are both fed to the policy network to check the reliability of the tracking result, which effectively distinguishes the response diversity. Finally, an efficient redetection module is proposed to filter a large number of searching areas, which greatly improves the speed. Our proposed algorithm is measured on OTB dataset. Assessment results show that our tracking algorithm improves performance by 5%-6% at the expense of only a small amount of speed.
What problem does this paper attempt to address?