Efficient Scalable Spatiotemporal Visual Tracking Based on Recurrent Neural Networks.

Yue Ming,Yashu Zhang
DOI: https://doi.org/10.1007/s11042-019-08331-4
IF: 2.577
2019-01-01
Multimedia Tools and Applications
Abstract:Robust and accurate visual tracking is challenging as targets undergo significant changes in appearance by scale variance, occlusion and fast motion. We propose a novel tracking framework, called scalable spatiotemporal visual tracking algorithm (SSVT). First, we construct the Direction Prediction Model (DPM) to predict the spatiotemporal correlation of the target in the next frame. That will efficiently narrow down the search area and improve the accuracy of spatial location. Then, Occlusion Detection algorithm (ODA) is presented to overcome the wrong updates stemming from the region of interest (ROI) based on the estimated direction and Kalman filter. Finally, the multi-scale pyramid kernelized correlation filter (MSPKCF) is presented in tracking to realize the adaptive adjustment of the varying scales of the targets and the ROI size. Extensive experiments on OTB100 and VOT2016 datasets demonstrate that our tracker performs favorably against state-of-the-art trackers, which can effectively reduce computation redundancy and improve tracking accuracy.
What problem does this paper attempt to address?