You Only Need Two Detectors to Achieve Multi-Modal 3D Multi-Object
Tracking
Xiyang Wang,Chunyun Fu,Jiawei He,Mingguang Huang,Ting Meng,Siyu Zhang,Hangning Zhou,Ziyao Xu,Chi Zhang
2023-01-01
Abstract:In the classical tracking-by-detection (TBD) paradigm, detection and tracking
are separately and sequentially conducted, and data association must be
properly performed to achieve satisfactory tracking performance. In this paper,
a new end-to-end multi-object tracking framework is proposed, which integrates
object detection and multi-object tracking into a single model. The proposed
tracking framework eliminates the complex data association process in the
classical TBD paradigm, and requires no additional training. Secondly, the
regression confidence of historical trajectories is investigated, and the
possible states of a trajectory (weak object or strong object) in the current
frame are predicted. Then, a confidence fusion module is designed to guide
non-maximum suppression for trajectories and detections to achieve ordered and
robust tracking. Thirdly, by integrating historical trajectory features, the
regression performance of the detector is enhanced, which better reflects the
occlusion and disappearance patterns of objects in real world. Lastly,
extensive experiments are conducted on the commonly used KITTI and Waymo
datasets. The results show that the proposed framework can achieve robust
tracking by using only a 2D detector and a 3D detector, and it is proven more
accurate than many of the state-of-the-art TBD-based multi-modal tracking
methods. The source codes of the proposed method are available at
https://github.com/wangxiyang2022/YONTD-MOT.