SiamDAG: Siamese Dynamic Receptive Field and Global Context Modeling Network for Visual Tracking.

Qing-hua Sheng,Jian Huang,Zhu Li,Chao-yu Zhou,Hai-bing Yin
DOI: https://doi.org/10.1007/s11042-022-12008-w
IF: 2.577
2022-01-01
Multimedia Tools and Applications
Abstract:Trackers based on anchor-free strategy have achieved a great success in recent years. However, they have limitations. To be specific, receptive fields of their models in each layer are fixed, so that the flexibility is lost. Then, they have no effective modeling of global context. Therefore, our model SiamDAG is put forward in this paper. The core part is Global Context - Selective Kernel block. This part can dynamically adjust its receptive field size based on multiple scales of input information, and model the global context effectively so that the tracker has the global understanding of a visual scene. Meanwhile, the Intersection over Union (IoU) prediction branch linking classification task and regression task is added. Our tracker was evaluated in VOT2019, OTB100 and GOT-10 k benchmark datasets, which achieved good results. It can also run up to 65FPS, far above the real-time requirement.
What problem does this paper attempt to address?