JDAN: Joint Detection and Association Network for Real-Time Online Multi-Object Tracking

Haidong Wang,Xuan He,Zhiyong Li,Jin Yuan,Shutao Li
DOI: https://doi.org/10.1145/3533253
IF: 4.094
2023-01-01
ACM Transactions on Multimedia Computing Communications and Applications
Abstract:In the last few years, enormous strides have been made for object detection and data association, which are vital subtasks for one-stage online multi-object tracking (MOT). However, the two separated submodules involved in the whole MOT pipeline are processed or optimized separately, resulting in a complex method design and requiring manual settings. In addition, few works integrate the two subtasks into a single end-to-end network to optimize the overall task. In this study, we propose an end-to-end MOT network called joint detection and association network (JDAN) that is trained and inferred in a single network. All layers in JDAN are differentiable, and can be optimized jointly to detect targets and output an association matrix for robust multi-object tracking. What’s more, we generate suitable pseudo-labels to address the data inconsistency between object detection and association. The detection and association submodules could be optimized by the composite loss function that is derived from the detection results and the generated pseudo association labels, respectively. The proposed approach is evaluated on two MOT challenge datasets, and achieves promising performance compared with classic and latest methods.
What problem does this paper attempt to address?