TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

Xiaoyan Cao,Yiyao Zheng,Yao Yao,Huapeng Qin,Xiaoyu Cao,Shihui Guo
2023-08-22
Abstract:Video data and algorithms have been driving advances in multi-object tracking (MOT). While existing MOT datasets focus on occlusion and appearance similarity, complex motion patterns are widespread yet overlooked. To address this issue, we introduce a new dataset called BEE23 to highlight complex motions. Identity association algorithms have long been the focus of MOT research. Existing trackers can be categorized into two association paradigms: single-feature paradigm (based on either motion or appearance feature) and serial paradigm (one feature serves as secondary while the other is primary). However, these paradigms are incapable of fully utilizing different features. In this paper, we propose a parallel paradigm and present the Two rOund Parallel matchIng meChanism (TOPIC) to implement it. The TOPIC leverages both motion and appearance features and can adaptively select the preferable one as the assignment metric based on motion level. Moreover, we provide an Attention-based Appearance Reconstruct Module (AARM) to reconstruct appearance feature embeddings, thus enhancing the representation of appearance features. Comprehensive experiments show that our approach achieves state-of-the-art performance on four public datasets and BEE23. Notably, our proposed parallel paradigm surpasses the performance of existing association paradigms by a large margin, e.g., reducing false negatives by 12% to 51% compared to the single-feature association paradigm. The introduced dataset and association paradigm in this work offers a fresh perspective for advancing the MOT field. The source code and dataset are available at <a class="link-external link-https" href="https://github.com/holmescao/TOPICTrack" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the problems of complex motion patterns and diverse scenarios in multi - object tracking (MOT). Specifically: 1. **Dataset problem**: Existing MOT datasets mainly focus on occlusion and appearance similarity, while ignoring the widely - existing complex motion patterns. For this reason, the author constructs a new dataset BEE23, which emphasizes complex motion patterns and aims to provide a more challenging benchmark to promote the research of general MOT algorithms. 2. **Algorithm optimization problem**: Most of the existing trackers adopt a single - feature association paradigm or a serial association paradigm. These paradigms cannot fully utilize different features, resulting in poor performance when dealing with complex scenarios. The author proposes a parallel association paradigm and designs the Two rOund Parallel matchIng meChanism (TOPIC) to implement this paradigm. TOPIC uses both motion and appearance features as association metrics and adaptively selects more appropriate features according to the motion level, thereby reducing the false positive rate. 3. **Feature representation problem**: In order to enhance the representation ability of appearance features, the author proposes the Attention - based Appearance Reconstruct Module (AARM), which reconstructs the appearance feature embedding through the attention mechanism, improving the discrimination ability between different objects and the similarity of the same object across frames. In summary, the main contributions of this paper are as follows: - Provide a new dataset BEE23, which emphasizes complex motion patterns and enriches the data resources for MOT research. - Propose a new parallel association paradigm and its implementation mechanism TOPIC, which improves the tracking performance in complex motion and diverse scenarios. - Introduce the AARM module, which enhances the representation ability of appearance features and further improves the tracking effect.