Visual tracking with dumbbell selection network

Tianpeng Liu,Jing Li,Jia Wu,Jun Chang,Yafu Xiao,Yan Hong
DOI: https://doi.org/10.1016/j.neucom.2022.10.031
IF: 6
2023-01-07
Neurocomputing
Abstract:The Siamese network-based trackers aim to train the convolutional neural network offline to match target templates and search regions. Recent researches have managed to adopt deep neural networks to extract sufficient semantic information for the Siamese trackers. However, most of the current methods suffer drastic target appearance variations for 1) failing to encode sufficient localization information of targets and 2) neglecting powerful cross-channel interaction information, thus reducing the learned features’ discriminative and representative ability the tracking accuracy. To remedy these issues, in this article, we propose a Dumbbell Selection Network (DuStNet) by exploring the correlation of the hierarchies of convolutional layers. In concrete, an adaptively Dumbbell Selection mechanism is presented to deal with the targets’ appearance deformation by providing rich semantic and localization information. Furthermore, a CSResNet is developed to improve the residual unit in backbones by strengthening the interdependence between the convolution feature channels. We ingeniously employ the Generalized Intersection over Union (GIoU) to supervise the cross-layer feature-map selection, improving tracking accuracy when utilized as a regression loss simultaneously. Our results suggest that the proposed method is robust to significant appearance variations and can generate more accurate bounding boxes in complicated scenarios. Extensive experimental results on large-scale benchmark datasets prove our method’s effectiveness, which achieves excellent performance on OTB2015, VOT2017, LaSOT, and VOT2019.
computer science, artificial intelligence
What problem does this paper attempt to address?