Hierarchical Convolutional Features for End-to-end Representation-Based Visual Tracking

Suguo Zhu,Zhenying Fang,Fei Gao
DOI: https://doi.org/10.1007/s00138-018-0947-6
IF: 2.983
2018-01-01
Machine Vision and Applications
Abstract:Recently, deep learning is widely developed in computer vision applications. In this paper, a novel simple tracker with deep learning is proposed to complete the tracking task. A simple fully convolutional Siamese network is applied to capture the similarity between different frames. Nevertheless, the detailed information from lower layers, which is also important for locating the target object, is not considered into the tracking task. In this paper, the detailed information from two lower layers is considered into the response map to improve the performance and not to increase much time spent. This leads more significant improvement for feature representation and localization of the target object. The experimental results demonstrate that the proposed algorithm is efficient and robust compared with the baseline and the state-of-the-art trackers.
What problem does this paper attempt to address?