Abstract:Recently, deep learning achieves competitive accuracy and robustness and dramatically improves the performance of target scale estimation through pre-trained special network branches. Yet, a fast and robust scale estimation method is still a challenging problem for visual object tracking. Early correlation filter tracking algorithm uses a multiscale search method to estimate the scale with the constant number of scale factors and invariant aspect ratio, which is redundant for the video frames with little or no scale change. Also, an independent network branch for target scale state is proposed, but the training network needs an abundance of datasets, and the effect is not very stable for the unseen target object. Aiming at the problems of existing scale estimation solutions, several variable scale learning methods are proposed to explore the scale change of the target. Firstly, we proposed a variable scale factor learning method, which makes us rid of the commonly used multiscale search with the flaws of fixed scale factors. Secondly, we used a multiscale aspect ratio solution to make up for invariant aspect ratio. Thirdly, the first and second scale methods were combined to propose a variable scale aspect ratio estimation method. Finally, the proposed scale estimation methods were embedded into the state-of-the-art ECO (Efficient Convolution Operators) and ATOM (Accurate Tracking by Overlap Maximization) trackers to replace the original scale methods for verifying the effectiveness of our proposed method. Extensive experiments on OTB100, UAV123, TC128 and LaSOT datasets demonstrate that the tracking performance can be improved effectively by using the proposed scale methods.

Tracking based on scale-estimated deep networks with hierarchical correlation ensembling for cross-media understanding

Robust Object Tracking with a Hierarchical Ensemble Framework

Collaborative Correlation Filters for Real-Time Tracking with Spatial Constraint.

Exploiting multi-scale hierarchical feature representation for visual tracking

Variable Scale Learning for Visual Object Tracking

Real-Time Scale Adaptive Visual Tracking with Context Information and Correlation Filters

Deep Scale Feature For Visual Tracking

Scale-Aware Tracking Method with Appearance Feature Filtering and Inter-Frame Continuity

Robust Visual Tracking Based on Scale Invariance and Deep Learning

Learning temporal context for correlation tracking with scale estimation

Homography Decomposition Networks for Planar Object Tracking

A Scale Adaptive Kernel Correlation Filter Tracker With Feature Integration

Robust and real-time deep tracking via multi-scale domain adaptation

Multi-hierarchical Independent Correlation Filters for Visual Tracking

Efficient Scale Estimation Methods using Lightweight Deep Convolutional Neural Networks for Visual Tracking

Visual object tracking based on residual network and cascaded correlation filters

Ensemble Tracking Based on Diverse Collaborative Framework With Multi-Cue Dynamic Fusion

Scale-Adaptive Tracking Method by Combining Kernelized Correlation Filter with Geometric Estimation

Deep Correlation Filter Tracking With Shepherded Instance-Aware Proposals

Robust Visual Tracking Based on Hierarchical Appearance Model.

Ensemble Tracking Based on CNN