Siamese Target Estimation Network with AIoU Loss for Real-Time Visual Tracking

Zhiyong Li,Chenming Hu,Ke Nai,Jin Yuan
DOI: https://doi.org/10.1016/j.jvcir.2021.103107
IF: 2.887
2021-01-01
Journal of Visual Communication and Image Representation
Abstract:The fully convolutional siamese network based trackers achieve great progress recently. Most of these methods focus on improving the capability of siamese network to represent the target. In this paper, we propose our model which focuses on estimating the state of the target with our proposed novel IoU (intersection over union) loss function which is named AIoU. Our model consists of a siamese subnetwork for feature extraction and a target estimation subnetwork for state representation. The target estimation subnetwork contains a classification head for classifying background and foreground and a regression head for estimating target. In order to regress better bounding boxes, we further study the loss function utilized in the regression head and propose a powerful IoU loss function. Our tracker achieves competitive performance on OTB2015, VOT2018, and VOT2019 benchmarks with a speed of 180 FPS, which proves the effectiveness of our method.
What problem does this paper attempt to address?