Deep Metric Learning for Visual Tracking
Hu, J.,Lu, J.,Tan, Yap-Peng
DOI: https://doi.org/10.1109/TCSVT.2015.2477936
2016-01-01
Abstract:In this paper, we propose a deep metric learning (DML) approach for robust visual tracking under the particle filter framework. Unlike most existing appearance-based visual trackers which use hand-crafted similarity metrics, our DML tracker learns a nonlinear distance metric to classify target object and background regions using a feed-forward neural network architecture. Since there are usually large variations in visual objects caused by varying deformations, illuminations, occlusions, motions, rotations, scales, and cluttered backgrounds, conventional linear similarity metrics cannot work well in such scenarios. To address this, our proposed DML tracker first learns a set of hierarchical nonlinear transformations in the feed-forward neural network to project both the template and particles into the same feature space where the intra-class variations of positive training pairs are minimized and the interclass variations of negative training pairs are maximized, simultaneously. Then, the candidate which is most similar to the template in the learned deep network is identified as the true target. Experiments on the benchmark dataset including 51 challenging videos show that our DML tracker achieves very competitive performance with the state-of-the-art trackers.