SCMOT: Improving 3D Multi-Object Tracking Via Semantic Inference and Confidence Optimization

Lin Zhao,Meiling Wang,Yufeng Yue
DOI: https://doi.org/10.1109/ccdc62350.2024.10587787
2024-01-01
Abstract:3D multi-object tracking (MOT) is a fundamental technology in autonomous systems, playing a pivotal role across applications like autonomous driving and intelligent transportation systems. Previous 3D MOT methods mainly rely on LiDAR point clouds for object detection and tracking, often facing challenges such as occlusions and sparse data. This paper introduces SCMOT, a novel multi-modal 3D MOT framework designed to address the limitations of existing LiDAR-based 3D MOT methods. SCMOT enhances 3D object detection by filtering and refining results using semantic information, thereby reducing erroneous or redundant detections. To improve data association and enhance tracking precision, a multi-modal cost function that combines prediction confidence, semantic cues, and distance information is presented. Moreover, SCMOT can be served as a plug-and-play solution, integrating with diverse point cloud-based 3D object detectors. Extensive experiments on the KITTI tracking dataset validate the feasibility and effectiveness of SCMOT in real-world autonomous driving scenarios.
What problem does this paper attempt to address?