Online Depth Image-Based Object Tracking with Sparse Representation and Object Detection

ZhengWei-Long,ShenShan-Chun,LuBao-Liang

IF: 2.565

2017-01-01

Neural Processing Letters

Abstract:Online object tracking under complex environments is an important but challenging problem in computer vision, especially for illumination changing and occlusion conditions. With the emergence of co...

What problem does this paper attempt to address?

Online Depth Image-Based Object Tracking with Sparse Representation and Object Detection

Shan-Chun Shen,Wei-Long Zheng,Bao-Liang Lu

DOI: https://doi.org/10.1007/s11063-016-9509-y

2014-01-01

Abstract:Online object tracking under complex environments is an important but challenging problem in computer vision, especially for illumination changing and occlusion conditions. With the emergence of commercial real-time depth cameras like Kinect, depth image-based object tracking, which is insensitive to illumination changing, gains more and more attentions. In this paper, we propose an online depth image-based object tracking method with sparse representation and object detection. In this framework, we combine tracking and detection to leverage precision and efficiency under heavy occlusion conditions. For tracking, objects are represented by sparse representations learned online with update. For detection, we apply two different strategies based on tracking-learning-detection and wider search window approaches. We evaluate our methods on both the subset of the public dataset Princeton Tracking Benchmark and our own driver face video in a simulated driving environment. The quantitative evaluations of precision and running time on these two datasets demonstrate the effectiveness and efficiency of our proposed object tracking algorithms.
Object tracking with 3D LIDAR via multi-task sparse learning

Shiyang Song,Zhiyu Xiang,Jilin Liu

DOI: https://doi.org/10.1109/icma.2015.7237897

2015-01-01

Abstract:Moving object tracking is a fundamental task for autonomous vehicles operating in urban areas. In this paper, a novel sparse learning based object tracking algorithm utilizing 3D LIDAR data is proposed. The 3D point clouds acquired from HDL-64E 3D LIDAR are first resampled on a virtual image plane, where the hypothesis of the targets is generated under the particle filtering framework. Four complementary features, i.e., normal orientation, depth, LBP and HOG, are extracted on each particle to describe the appearance of the candidates. Then a multi-task multi-cue sparse learning algorithm is employed to select the best candidate and realize the tracking of the object. To improve the robustness of the algorithm, the sparse learning framework is further enhanced by a specifically designed background filtering and occlusion detection mechanism. The experiments carried out on KITTI benchmark show promising object tracking performance, especially when handling complex tracking situations such as occlusion and posture change.
Detecting and Tracking Dynamic Objects in Complex Environments

LX Zhou,JL Liu,WK Gu

DOI: https://doi.org/10.1117/12.323681

1998-01-01

Abstract:This paper presents a robust algorithm for detecting and tracking multiple targets in a long video sequence. What differentiate it from previous approaches are the complex environments, the unconstrained camera motion, as well as the comparatively short distance between the sensor and the targets. Firstly the background motion is estimated by a global LMedS algorithm, then on the compensated difference image a graph-like stochastic procedure is applied for tracking multiple moving objects. Real video experiments show its efficiency.
Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking Via Memory Networks

Jongwon Ra,MengMeng Wang,Jianbiao Mei,Shanqi Liu,Yu Yang,Yong Liu

DOI: https://doi.org/10.1109/3dv62453.2024.00050

2024-01-01

Abstract:The point cloud-based 3D single object tracking plays an indispensable role in autonomous driving. However, the application of 3D object tracking in the real world is still challenging due to the inherent sparsity and self-occlusion of point cloud data. Therefore, it is necessary to exploit as much useful information from limited data as we can. Since 3D object tracking is a video-level task, the appearance of objects changes gradually over time, and there is rich spatiotemporal contextual information among historical frames. However, existing methods do not fully utilize this information. To address this, we propose a new method called SCTrack, which utilizes a memory-based paradigm to exploit spatiotemporal contextual information. SCTrack incorporates both long-term and short-term memory banks to store the spatiotemporal features of targets from historical frames. By doing so, the tracker can benefit from the entire video sequence and make more informed predictions. Additionally, SCTrack extracts the mask prior to augmenting the target representation, improving the target-background discriminability. Extensive experiments on KITTI, nuScenes, and Waymo Open datasets verify the effectiveness of our proposed method.
Visual Tracking via Sparse Representation and Online Dictionary Learning.

Xu Cheng,Nijun Li,Tongchi Zhou,Lin Zhou,Zhenyang Wu

DOI: https://doi.org/10.1007/978-3-319-13323-2_8

2014-01-01

Abstract:Sparse representation has been shown competitive performance on single object tracking. In this paper, we extend this technique to tracking multiple interactive objects and present a novel sparse tracker under the tracking-by-detection framework, with saliency detector for objects detection and sparse representation for objects association. Furthermore, we propose an online dictionary learning scheme to capture appearance variations of objects. To avoid using trivial templates, the dictionary contains not only objects templates, but also background information, resulting in more robust estimation. The experiments demonstrate that our approach achieves favorable performance over state-of-the-art algorithms.
Online discriminative object tracking with local sparse representation

Qing Wang, Feng Chen, Wenli Xu,Ming-Hsuan Yang

DOI: https://doi.org/10.1109/WACV.2012.6162999

2012-01-01

Abstract:We propose an online algorithm based on local sparse representation for robust object tracking. Local image patches of a target object are represented by their sparse codes with an over-complete dictionary constructed online, and a classifier is learned to discriminate the target from the background. To alleviate the visual drift problem often encountered in object tracking, a two-stage algorithm is proposed to exploit both the ground truth information of the first frame and observations obtained online. Different from recent discriminative tracking methods that use a pool of features or a set of boosted classifiers, the proposed algorithm learns sparse codes and a linear classifier directly from raw image patches. In contrast to recent sparse representation based tracking methods which encode holistic object appearance within a generative framework, the proposed algorithm employs a discrimination formulation which facilitates the tracking task in complex environments. Experiments on challenging sequences with evaluation of the state-of-the-art methods show effectiveness of the proposed algorithm.
Visual Tracking Based on Online Sparse Feature Learning.

Zelun Wang,Jinjun Wang,Shun Zhang,Yihong Gong

DOI: https://doi.org/10.1016/j.imavis.2015.04.005

IF: 3.86

2015-01-01

Image and Vision Computing

Abstract:Various visual tracking approaches have been proposed for robust target tracking, among which using sparse representation of the tracking target yields promising performance. Some earlier works in this line used a fixed subset of features to compress the target's appearance, which has limited modeling capacity between the target and the background, and could not accommodate their appearance change over long period of time. In this paper, we propose a visual tracking method by modeling targets with online-learned sparse features. We first extract high dimensional Haar-like features as an over-completed basis set, and then solve the feature selection problem in an efficient L1-regularized sparse-coding process. The selected low-dimensional representation best discriminates the target from its neighboring background. Next we use a naive Bayesian classifier to select the most-likely target candidate by a binary classification process. The online feature selection process happens when there are significant appearance changes identified by a thresholding strategy. In this way, our proposed method could work for long tracking tasks. At the same time, our comprehensive experimental evaluation has shown that the proposed methods achieve excellent running speed and higher accuracy over many state-of-the-art approaches.
SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Zelin Liu,Xinggang Wang,Cheng Wang,Wenyu Liu,Xiang Bai

DOI: https://doi.org/10.48550/arXiv.2306.05238

2023-06-08

Computer Vision and Pattern Recognition

Abstract:Exploring robust and efficient association methods has always been an important issue in multiple-object tracking (MOT). Although existing tracking methods have achieved impressive performance, congestion and frequent occlusions still pose challenging problems in multi-object tracking. We reveal that performing sparse decomposition on dense scenes is a crucial step to enhance the performance of associating occluded targets. To this end, we propose a pseudo-depth estimation method for obtaining the relative depth of targets from 2D images. Secondly, we design a depth cascading matching (DCM) algorithm, which can use the obtained depth information to convert a dense target set into multiple sparse target subsets and perform data association on these sparse target subsets in order from near to far. By integrating the pseudo-depth method and the DCM strategy into the data association process, we propose a new tracker, called SparseTrack. SparseTrack provides a new perspective for solving the challenging crowded scene MOT problem. Only using IoU matching, SparseTrack achieves comparable performance with the state-of-the-art (SOTA) methods on the MOT17 and MOT20 benchmarks. Code and models are publicly available at \url{https://github.com/hustvl/SparseTrack}.
Effective and Robust Object Tracking in Constrained Environments

Junda Zhu,Yuanwei Lao,Yuan F. Zheng

DOI: https://doi.org/10.1109/icassp.2008.4517768

2008-01-01

Abstract:We present a new scheme for efficient and robust video object tracking in constrained environments. It finds its application in security surveillance, traffic monitoring, etc. In these applications movements of objects are restricted by the environments; therefore, environment constraints can be exploited as heuristic information for improving the performance of tracking. In this paper we use the distance field to represent environment constraints and integrate it into the framework of particle filtering. Experiments on some video surveillance sequences demonstrate the effectiveness of our approach.
Online Learning of Multi-Feature Weights for Robust Object Tracking

Tao Zhou,Harish Bhaskar,Kai Xie,Jie Yang,Xiangjian He,Pengfei Shi

DOI: https://doi.org/10.1109/icip.2015.7350894

2015-01-01

Abstract:Sparse Representation based Classification (SRC) and its potential in object tracking have been explored in recent years. However, the trade-off between the discriminative ability of the overly emphasized sparse representation and the lack of insight on correlation of visual information has raised questions over the general applicability of such methods in object tracking. In addition, the need for the optimization of a series of l(1)-regularized least square norm, increases the computational complexity thereby limiting their usage in real-time applications. In this paper, a novel approach to robust object tracking is proposed. First, the variations in the appearance of the tracked target is modelled using PCA basis vectors, and further, a l(2)-regularized least square method is used to solve the proposed representation model. In order to improve the robustness of feature representation in object tracking applications, weights are associated with multiple trackers; each formulated using a different feature, and adapted via an online learning scheme. Finally, a decision fusion criterion is imposed to generate an optimized output through the weighted combination of different tracking results. Experiments on challenging video sequences have demonstrated the superior accuracy and robustness of the proposed method in comparison to thirteen other state-of-the-art baselines.
Tracking in Multimedia Data Via Robust Reweighted Local Multi-Task Sparse Representation for Transportation Surveillance

Jiping Xiong,Qinghua Tang,Xiaowei He,Lisang Cai,Fei Wang

DOI: https://doi.org/10.1007/s11042-016-3464-5

IF: 2.577

2016-01-01

Multimedia Tools and Applications

Abstract:It is of great importance in smart transportation surveillance to track object reliably from multimedia streaming data. Sparse representation based target tracking methods often suffer from tracking failure when target is under occlusions, pose changes or illumination changes conditions. In this paper, we propose a novel robust reweighted local multi-task sparse tracking algorithm. In the algorithm, local patches of all candidate targets are represented as a linear combination of the corresponding local patches from the template dictionary. Furthermore, in order to efficiently capture the frequently emerging outlier tasks, we decompose the sparse coefficient matrix to two collaborative matrices to make sure that the same type of particles share the same sparse structure. Observing that the edge of the candidate object contains background information, this paper gives a lower weight coefficient to the reconstruction error regularization located in the edge of the local patches than the middle local patches. Experimental evaluations on challenging sequences demonstrate the effectiveness, accuracy and robustness of our proposed algorithm in comparison with state-of-the-art algorithms.
On Combining Compressed Sensing and Sparse Representations for Object Tracking.

Hang Sun,Jing Li,Bo Du,Dacheng Tao

DOI: https://doi.org/10.1007/978-3-319-48890-5_4

2016-01-01

Abstract:The tracking algorithm of compressed sensing takes advantage of the objective's background information, but lacks the feedback mechanism towards the results. The 11 sparse tracking algorithm adapts to the changes in the objectives' appearances but at the cost of losing their background information. To enhance the effectiveness and robustness of the algorithm in coping with such distractions as occlusion and illumination variation, this paper proposes a tracking framework with the 11 sparse representation being the detector and compressed sensing algorithm the tracker, and establishes a complementary classifier model. A second-order model updating strategy has therefore been proposed to preserve the most representative templates in the 11 sparse representations. It is concluded that this tracking algorithm is better than the prevalent 8 ones with a respective precision plot of 77.15ï¾¿%, 72.33ï¾¿% and 81.13ï¾¿% and a respective success plot of 77.67ï¾¿%, 74.01ï¾¿%, 81.51ï¾¿% in terms of the overall, occlusion and illumination variation.
Robust visual tracking based on online learning sparse representation

Shengping Zhang,Hongxun Yao,Huiyu Zhou,Xin Sun,Shaohui Liu

DOI: https://doi.org/10.1016/j.neucom.2011.11.031

IF: 6

2013-01-01

Neurocomputing

Abstract:Handling appearance variations is a very challenging problem for visual tracking. Existing methods usually solve this problem by relying on an effective appearance model with two features: (1) being capable of discriminating the tracked target from its background, (2) being robust to the target's appearance variations during tracking. Instead of integrating the two requirements into the appearance model, in this paper, we propose a tracking method that deals with these problems separately based on sparse representation in a particle filter framework. Each target candidate defined by a particle is linearly represented by the target and background templates with an additive representation error. Discriminating the target from its background is achieved by activating the target templates or the background templates in the linear system in a competitive manner. The target's appearance variations are directly modeled as the representation error. An online algorithm is used to learn the basis functions that sparsely span the representation error. The linear system is solved via @?"1 minimization. The candidate with the smallest reconstruction error using the target templates is selected as the tracking result. We test the proposed approach using four sequences with heavy occlusions, large pose variations, drastic illumination changes and low foreground-background contrast. The proposed approach shows excellent performance in comparison with two latest state-of-the-art trackers.
Object Tracking Algorithm under Occlusion Based on Sparse Representation

GAO Lin,FAN Yong,CHEN Nian-nian,LI Yu-feng,LI Hui-zhuo,ZHANG Jin-feng

DOI: https://doi.org/10.3969/j.issn.1000-3428.2012.15.002

2012-01-01

Abstract:A novel visual tracking algorithm based on sparse representation is proposed to solve the problem of occlusion.The tracked object is described using the sparse representation method,and the image Gabor-features are used to construct the object dictionary and occlusion dictionary.The optimal sparse coding coefficients are obtained via l1-norm minimization.The tracking algorithm is designed in a particle filtering framework.The occlusion is judged according to the distribution of nonzero values in sparse coding coefficients.Under occlusion,the particles’ weight is calculated based on the approximation residual of observation by sparse representation.A template reliability evaluation method is introduced to suppress the drift during the object dictionary update.Experimental results show that the proposed algorithm can handle occlusion efficiently,and be robust to pose and illumination variations.
Real-Time Online Multi-Object Tracking

Mengyun Yi,Sheng Zhang,He Xu

DOI: https://doi.org/10.1145/3374587.3374628

2019-01-01

Abstract:In recent years, object detection technology has been continuously developed, and the tracking-by-detection strategy has gradually become the main method of multi-object tracking. Based on detection, the accuracy of the multi-object tracking depends on the detection results to a large extent. However, in many practical applications, especially the case of complex scenes and crowded objects, the detection results are usually inaccurate. In this paper, a joint detection and tracking framework is proposed with a unified confidence scoring function to evaluate tracks confidence and complement low confidence detections with high confidence tracks. In this way, detections and tracks can be combined organically and achieved complementarity. High confidence detection results can prevent long-term tracking drift, while high confidence tracking prediction can deal with false detection and missed detection caused by occlusion during object interaction. Moreover, we trained the ReID appearance feature with higher identification capabilities on the large-scale person re-identification datasets, which has higher identification capability. Extensive experiments are conducted on MOT17 benchmarks to demonstrate the real-time and advanced performance of our tracker.
Online Multiple Object Tracking Via Exchanging Object Context.

Hongyang Yu,Lei Qin,Qingming Huang,Hongxun Yao

DOI: https://doi.org/10.1016/j.neucom.2018.02.068

IF: 6

2018-01-01

Neurocomputing

Abstract:Multiple object tracking is a key problem for many computer vision applications such as video surveillance, advanced driver assistance or animation. Most of existing tracking-by-detection methods are mainly based on object appearances and motions. However, the contextual information around the target has not been fully exploited. In this paper, we pay more attention to the contextual information and propose an Exchanging Object Context (EOC) model, which takes full advantage of the context information. Specifically, we implement an efficient and accurate online multiple object tracking algorithm with a novel affinity measure to associate detections. This measure calculates the similarity between targets and detections with the background smoothness after exchanging the contexts between detections and targets, using a novel color histogram descriptor. We refine the bounding boxes by measuring the context changes. Extensive experimental results on two public benchmarks demonstrate the effectiveness of the proposed tracking method with comparisons to several state-of-the-art trackers.
Object Tracking by Occlusion Detection via Structured Sparse Learning

Tianzhu Zhang,Bernard Ghanem,Changsheng Xu,Narendra Ahuja

DOI: https://doi.org/10.1109/CVPRW.2013.150

2013-01-01

Abstract:Sparse representation based methods have recently drawn much attention in visual tracking due to good performance against illumination variation and occlusion. They assume the errors caused by image variations can be modeled as pixel-wise sparse. However, in many practical scenarios these errors are not truly pixel-wise sparse but rather sparsely distributed in a structured way. In fact, pixels in error constitute contiguous regions within the object's track. This is the case when significant occlusion occurs. To accommodate for non-sparse occlusion in a given frame, we assume that occlusion detected in previous frames can be propagated to the current one. This propagated information determines which pixels will contribute to the sparse representation of the current track. In other words, pixels that were detected as part of an occlusion in the previous frame will be removed from the target representation process. As such, this paper proposes a novel tracking algorithm that models and detects occlusion through structured sparse learning. We test our tracker on challenging benchmark sequences, such as sports videos, which involve heavy occlusion, drastic illumination changes, and large pose variations. Experimental results show that our tracker consistently outperforms the state-of-the-art.
Occlusion-Aware Real-Time Object Tracking

Xingping Dong,Jianbing Shen,Dajiang Yu,Wenguan Wang,Jianhong Liu,Hua Huang

DOI: https://doi.org/10.1109/tmm.2016.2631884

IF: 7.3

2017-01-01

IEEE Transactions on Multimedia

Abstract:The online learning methods are popular for visual tracking because of their robust performance for most video sequences. However, the drifting problem caused by noisy updates is still a challenge for most highly adaptive online classifiers. In visual tracking, target object appearance variation, such as deformation and long-term occlusion, easily causes noisy updates. To overcome this problem, a new real-time occlusion-aware visual tracking algorithm is introduced. First, we learn a novel two-stage classifier with circulant structure with kernel, named integrated circulant structure kernels (ICSK). The first stage is applied for transition estimation and the second is used for scale estimation. The circulant structure makes our algorithm realize fast learning and detection. Then, the ICSK is used to detect the target without occlusion and build a classifier pool to save these classifiers with noisy updates. When the target is in heavy occlusion or after longterm occlusion, we redetect it using an optimal classifier selected from the classifier-pool according to an entropy minimization criterion. Extensive experimental results on the full benchmark demonstrate our real-time algorithm achieves better performance than state-of-the-art methods.
Online Feature Extraction and Selection for Object Tracking

Wei He,Xiaolin Zhao,Li Zhang

DOI: https://doi.org/10.1109/icma.2007.4304126

2007-01-01

Abstract:Object tracking is a challenging problem in realtime computer vision, especially when the circumstance is unstable due to variations of lighting, pose, and view-point. This paper presents an online feature selection mechanism by extracting and evaluating multiple color features. Given a tracking image, we use clustering method to segment the object according to different color, and generate Gaussian model for each segment respectively to extract the color feature. Then we judge the discrimination of the features and select an appropriate feature subset, by which the object can be distinguished from the background at the highest SNR(signal noise ratio). This feature selection mechanism is embedded in a mean-shift tracking system that updating the feature set adaptively. Examples are presented to show that our method is robust to complicated object and changing background.
Tracking Multiple Objects Through Occlusion with Online Sampling and Position Estimation

Lin Zhu,Jie Zhou,Jingyan Song

DOI: https://doi.org/10.1016/j.patcog.2008.01.014

IF: 8

2008-01-01

Pattern Recognition

Abstract:To track multiple objects through occlusion, either depth information of the scene or prior models of the objects such as spatial models and smooth/predictable motion models are usually assumed before tracking. When these assumptions are unreasonable, the tracker may fail. To overcome this limitation, we propose a novel online sample based framework, inspired by the fact that the corresponding local parts of objects in sequential frames are always similar in the local color and texture features and spatial features relative to the centers of objects. Experimental results illustrate that the proposed approach works robustly under difficult and complex conditions.

Online Depth Image-Based Object Tracking with Sparse Representation and Object Detection

Online Depth Image-Based Object Tracking with Sparse Representation and Object Detection

Object tracking with 3D LIDAR via multi-task sparse learning

Detecting and Tracking Dynamic Objects in Complex Environments

Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking Via Memory Networks

Visual Tracking via Sparse Representation and Online Dictionary Learning.

Online discriminative object tracking with local sparse representation

Visual Tracking Based on Online Sparse Feature Learning.

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Effective and Robust Object Tracking in Constrained Environments

Online Learning of Multi-Feature Weights for Robust Object Tracking

Tracking in Multimedia Data Via Robust Reweighted Local Multi-Task Sparse Representation for Transportation Surveillance

On Combining Compressed Sensing and Sparse Representations for Object Tracking.

Robust visual tracking based on online learning sparse representation

Object Tracking Algorithm under Occlusion Based on Sparse Representation

Real-Time Online Multi-Object Tracking

Online Multiple Object Tracking Via Exchanging Object Context.

Object Tracking by Occlusion Detection via Structured Sparse Learning

Occlusion-Aware Real-Time Object Tracking

Online Feature Extraction and Selection for Object Tracking

Tracking Multiple Objects Through Occlusion with Online Sampling and Position Estimation