Joint Learning Appearance and Motion Models for Visual Tracking

Wenmei Xu,Hongyuan Yu,Wei Wang,Chenglong Li,Liang Wang
DOI: https://doi.org/10.1007/978-3-030-88004-0_34
2021-01-01
Abstract:Motion information is a key characteristic in the description of target objects in visual tracking. However, seldom of existing works consider the motion features and tracking performance is thus easily affected when appearance features are not reliable in challenging scenarios. In this work, we propose to leverage motion cues in a novel deep network for visual tracking. In particular, we employ the optical flow to effectively model motion cues and reduce background interferences. With a modest impact on efficiency, both appearance and motion features are used to significantly improve tracking accuracy and robustness. At the same time, we use a few strategies to update our tracker online so that we can avoid error accumulation. Extensive experiments validate that our method achieves better results against state-of-the-art methods on several public datasets, while operating at a real-time speed.
What problem does this paper attempt to address?