Abstract:Object tracking is one of the most important components in numerous applications of computer vision. Remote sensing videos provided by commercial satellites make it possible to extend this topic into the earth observation domain. In satellite videos, typical moving targets like vehicles and planes only cover a small area of pixels, and they could easily be confused with surrounding complex ground scenes. Similar objects nearby in satellite videos can hardly be differed by appearance details due to the resolution constraint. Thus, tracking drift caused by distractions is also a thorny problem. Facing challenges, traditional tracking methods such as correlation filters with hand-crafted visual features achieve unsatisfactory results in satellite videos. Methods based on deep neural networks have demonstrated their superiority in various ordinary visual tracking benchmarks, but their results on satellite videos remain unexplored. In this article, deep learning technologies are applied to object tracking in satellite videos for better performance. A simple regression network is used to combine a regression model with convolutional layers and a gradient descent algorithm. The regression network fully exploits the abundant background context to learn a robust tracker. Instead of handcrafted features, both appearance features and motion features, which are extracted by pretrained deep neural networks, are used for accurate object tracking. In cases when the tracker encounters ambiguous appearance information, the motion features could provide complementary and discriminative information to improve tracking performances. Experimental results on various satellite videos show that the proposed method achieves better tracking performance than other state-of-the-arts.

Learning Deconvolutional Network for Object Tracking

Hierarchical Convolutional Features for Visual Tracking

Deep Convolutional Correlation Filter Learning Toward Robust Visual Object Tracking

Visual object tracking based on residual network and cascaded correlation filters

Robust Visual Tracking Via Hierarchical Convolutional Features

CFNN: Correlation Filter Neural Network for Visual Object Tracking

Learning Fully Convolutional Network for Visual Tracking with Multi-Layer Feature Fusion

Robust Visual Tracking via Convolutional Networks

Learning Spatio-Temporal Convolutional Network for Real-Time Object Tracking

Visual Tracking with Fully Convolutional Networks

Deformable Object Tracking With Gated Fusion

Convolutional Neural Networks Based Scale-Adaptive Kernelized Correlation Filter For Robust Visual Object Tracking

Target-Aware Deep Tracking

Convolutional Features Combining SL(3) Group for Visual Tracking.

A Twofold Convolutional Regression Tracking Network with Temporal and Spatial Mechanism

Learning Channel-Aware Deep Regression for Object Tracking

Taking Full Advantage of Convolutional Network for Robust Visual Tracking

Deep Deblurring Correlation Filter for Object Tracking

Object Tracking in Satellite Videos Based on Convolutional Regression Network with Appearance and Motion Features

Visual Tracking with Kernelized Correlation Filters Based on Multiple Features

Robust Object Tracking Based On Temporal And Spatial Deep Networks