Visual Object Tracking Via Guessing and Matching

Ke Song,Wei Zhang,Weizhi Lu,Zheng-Jun Zha,Xiangyang Ji,Yibin Li
DOI: https://doi.org/10.1109/tcsvt.2019.2948600
IF: 5.859
2020-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Visual object tracking is a fundamental and time-critical vision task. However, most trackers such as SiamFC and CFNet missed the object movement and simply defined the searching region centered at the location of the target in the previous frame. So they tend to fail in the cases with severe occlusion or a large displacement of the target. In this paper, we consider the object tracking as a dual-task problem of guessing and matching. A guess module is to estimate the motion trend of the target by reinforcement learning based on the observations on appearance changes and motion history. Rather than using the previous location of the target, we may have a more accurate center to locate the searching region. Benefited from such improved searching region, the match module becomes less prone to the object drift problem, and can easily identify the target from the potential distractors in the background. Extensive experimental results on benchmark datasets such as RGBT, OTB-2013, OTB-50 and OTB-100, show that the proposed method achieves leading performance compared to state-of-the-art trackers. Moreover, the proposed tracker could maintain real-time speed, giving itself the potential in practical applications.
What problem does this paper attempt to address?