Multi-correlation filters method for robust visual tracking
Qianru Chen,Risheng Liu,Xin Fan,Haojie Li
DOI: https://doi.org/10.11834/jig.170387
2018-01-01
Journal of Image and Graphics
Abstract:Objective Due to pose variation of the target,occlusion and background clutter in complex scene,visual object tracking is still a challenging task.Recently,discriminative correlation filter methods have been successfully and widely applied to visual tracking problem.The standard correlation filter method obtains a number of training samples by cyclic shift,and solves the filters by fast Fourier transform algorithm,which makes it have good real-time and robustness.However,the negative training samples caused by the boundary shift reduce the tracking effect.Spatially regularized correlation filters based tracker enhances the effect of target area by introducing a spatial weight function,which makes the difference between positive and negative samples more obvious.The target search area is increased while the computation time is also increased.In addition,for those complex scene,in which,target deformation is irregular or background is similar,the background filters are also enhanced which result in failure of tracking.Method In order to address the above problems,an adaptive fusion of multiple correlation filters method is proposed in this paper.The unconstrained correlation filter tracking problem is transformed into two sub problems with constraints via an alternating direction multiplier optimization method.And two sub problems are solved by different correlation filter methods.Firstly,standard correlation filters are used to locate target coarsely,and then the relocation is done via spatially regularized correlation filters,which adjusts the target position to improve the tracking effect.Result In the experiment,the algorithm is evaluated on 100 videos of OTB-2015 benchmark dataset and compared with other state-of-the-art trackers,and the central coordinate error and the overlap rate of target frame are used as evaluation criteria.And the algorithm can handle variation in position,scale,and occlusion and shows the best results in CarScale,Freeman4,Girl and other videos.The average center position error of 100 videos is 28.55 pixels and the average overlap rate of target frame is 61%.Compared with the methods which utilize artificial fea tures,our algorithm is better than those other algorithms.Compared with the correlation filter method using deep feature such as CNN feature,the average center position error of our algorithm is 6 pixels higher,but the average overlap rate of target frame improves 4%.Conclusion Extensive experimental results show that our algorithm has better accuracy and robustness under appearance changes such as variation in position and scale.