Learning Attentional Recurrent Neural Network for Visual Tracking

Qiurui Wang,Chun Yuan,Jingdong Wang,Wenjun Zeng
DOI: https://doi.org/10.1109/tmm.2018.2869277
IF: 7.3
2018-01-01
IEEE Transactions on Multimedia
Abstract:We propose a novel online Attentional Recurrent Neural Network (ARNN) model for visual tracking, which exploits the feature maps of Convolutional Neural Network (CNN) inside a bounding box to identify whether this target is the one appeared in previous frames. Attention mechanism is adopted for both different parts of targets and different scales of object features. The former attention model is able to select important regions to better trace the target while the latter one learns to weight the multiple scale features for accurate object location. We jointly train the recurrent network with the region based and scale based attention mechanism. The outstanding performances in the experiments validate the effectiveness of our proposed ARNN and show that ARNN outperforms the state-of-the-art tracking methods.
What problem does this paper attempt to address?