Tracklet Siamese Network with Constrained Clustering for Multiple Object Tracking

Jinlong Peng,Fan Qiu,John See,Qi Guo,Shaoshuai Huang,Ling-Yu Duan,Weiyao Lin
DOI: https://doi.org/10.1109/vcip.2018.8698623
2018-01-01
Abstract:Multiple object tracking (MOT) is an important yet challenging task in video understanding and analysis. Basically, MOT aims to associate detected objects into trajectories based on their temporal relationships. The occlusion among moving objects poses a major challenge towards robust modeling of these relationships. In this paper, we propose a novel Tracklet Siamese Network (TSN) for learning similarities between track-lets characterized by appearance information, achieving superior performance on two MOTChallenge benchmark datasets. Our framework constructs short tracklets from highly-related object detections by excluding inaccurate object detections. We also adopt a constrained clustering technique to piece tracklets together into long trajectories, thus recovering many missing detections caused by original detector or the detection removing in the previous step. Comparisons against state-of-the-art methods were reported while ablation studies further substantiate the viability of components in our approach.
What problem does this paper attempt to address?