Enhancing Discriminative Appearance Model for Visual Tracking.

Xuedong He,Calvin Yu-Chian Chen
DOI: https://doi.org/10.1016/j.eswa.2023.119670
IF: 8.5
2023-01-01
Expert Systems with Applications
Abstract:Most end-to-end discriminative trackers obtain the improvements by a large margin through mining more representative target features for localizing the target, yet it isn't trivial to get comprehensive features as a result of diverse challenges. Moreover, we note that prevalent discriminative trackers merely use a convolution block to process pre-trained residual features to acquire target classification features, which further is input into the appearance model to obtain a target classifier, aiming at obtaining a response map. In view of this problem, we propose a novel feature enhancement module to obtain a richer and comprehensive feature representation, which is end-to-end trainable. Furthermore, discriminative trackers equipped with an online update mechanism demand to refine the classification model with recent samples, which will more or less learn the inaccurate tracking results into the model, thus weakening its discriminative ability. To alleviate this issue, we use a metric learning method to devise a verifier, which is devoted to yielding a similarity score to make piecewise predictions. By comparing the similarity score and piecewise thresholds, this strategy can enhance the robustness of the tracker by adaptively regulating the update of the sample memory and the learning rate. It is worth mentioning that the verifier module has strong portability. We choose the recent SuperDiMP as a baseline and implement comprehensive experimental tests and analyses of our approach on six popular benchmarks, which confirm that the proposed methods perform favorably against state-of-the-art trackers.
What problem does this paper attempt to address?