Abstract:Recently, deep learning has achieved great success in visual tracking. The goal of this paper is to review the state-of-the-art tracking methods based on deep learning. First, we introduce the background of deep visual tracking, including the fundamental concepts of visual tracking and related deep learning algorithms. Second, we categorize the existing deep-learning-based trackers into three classes according to network structure, network function and network training. For each categorize, we explain its analysis of the network perspective and analyze papers in different categories. Then, we conduct extensive experiments to compare the representative methods on the popular OTB-100, TC-128 and VOT2015 benchmarks. Based on our observations, we conclude that: (1) The usage of the convolutional neural network (CNN) model could significantly improve the tracking performance. (2) The trackers using the convolutional neural network (CNN) model to distinguish the tracked object from its surrounding background could get more accurate results, while using the CNN model for template matching is usually faster. (3) The trackers with deep features perform much better than those with low-level hand-crafted features. (4) Deep features from different convolutional layers have different characteristics and the effective combination of them usually results in a more robust tracker. (5) The deep visual trackers using end-to-end networks usually perform better than the trackers merely using feature extraction networks. (6) For visual tracking, the most suitable network training method is to per-train networks with video information and online fine-tune them with subsequent observations. Finally, we summarize our manuscript and highlight our insights, and point out the further trends for deep visual tracking.

Deep Learning for Visual Tracking: A Comprehensive Survey

Deep Learning in Visual Tracking: A Review

Deep Learning Based Visual Tracking: A Review

Advances in Deep Learning Methods for Visual Tracking: Literature Review and Fundamentals

Deep visual tracking: Review and experimental comparison

Visual object tracking: A survey

Single Object Tracking Research: A Survey

Single Object Tracking Research:A Survey

Study on Deep Learning and Its Application in Visual Tracking.

A Survey of Visual Tracking

Deep learning in multi-object detection and tracking: state of the art

Deep learning for multiple object tracking: a survey

Deep Learning-based Visual Multiple Object Tracking:A Review

Deep Learning for Unmanned Aerial Vehicle-Based Object Detection and Tracking: A Survey

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Deep Learning-Based Object Tracking in Satellite Videos: A Comprehensive Survey with a New Dataset

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

A Review of Target Tracking Based on Deep Learning

Review on Video Object Tracking Based on Deep Learning

Survey of Single-Target Visual Tracking Methods Based on Online Learning.

A Review of Deep Learning-Based Visual Multi-Object Tracking Algorithms for Autonomous Driving