Accurate and Robust Visual Tracking Using Bounding Box Refinement and Online Sample Filtering.

Yijin Yang,Xiaodong Gu
DOI: https://doi.org/10.1016/j.image.2023.116981
IF: 3.453
2023-01-01
Signal Processing Image Communication
Abstract:Discriminative correlation trackers have currently achieved excellent performance in terms of tracking robustness. However, these trackers still suffer from limited precision of bounding box estimation due to the challenging factors of occlusion, deformation and rotation. In this paper, in order to address these issues, we propose a three-stage tracking framework called BROST. The proposed tracker is mainly composed of DCF module, segmentation module and box refinement module. Firstly, the proposed tracker roughly locates the center position of the object through the DCF module, then utilizes the segmentation module to estimate the scale of the object and finally employs the box refinement module to improve the accuracy of target box estimation. In order to achieve high tracking robustness, we develop a confidence function of correlation response map to filter out the corrupted or occluded training samples of DCF module. Besides, we introduce a new mask initialization network into the segmentation module to make it more suitable for tracking task. The comprehensive experimental results on six challenging visual tracking benchmarks show that the proposed BROST tracker outperforms most of the state-of-the-art trackers and achieves favorable tracking performance on VOT benchmarks.
What problem does this paper attempt to address?