P2FTrack: Multi-object Tracking with Motion Prior and Feature Posterior

Hong Zhang,Jiaxu Wan,Jing Zhang,Ding Yuan,XuLiang Li,Yifan Yang
DOI: https://doi.org/10.1145/3700443
2024-01-01
Abstract:Multiple object tracking (MOT) has emerged as a crucial component of the rapidly developing computer vision. However, existing multi-object tracking methods often overlook the relationship between features and motion, hindering the ability to strike a performance balance between coupled motion and complex scenes. In this work, we propose a novel end-to-end multi-object tracking method that integrates motion and feature information. To achieve this, we introduce a motion prior generator that transforms motion information into attention masks. Additionally, we leverage prior-posterior fusion multi-head attention to combine the motion-derived priors and attention-based posteriors. Our proposed method is extensively evaluated on MOT17 and DanceTrack datasets through comprehensive experiments and ablation studies, demonstrating state-of-the-art performance in the feature-based method with reasonable speed.
What problem does this paper attempt to address?