Voxel-to-Pillar: Knowledge Distillation of 3D Object Detection in Point Cloud

Jinbao Zhang,Jun Liu
DOI: https://doi.org/10.1145/3651640.3651652
2024-01-01
Abstract:LiDAR point cloud object detection plays an important role in autonomous driving. However, there is a conflict between high accuracy and inference speed which hinders further development. Voxel-based networks can achieve high accuracy, but the 3D sparse convolution in the voxel-based networks for feature extraction blocks the rapid inference and model deployment. Pillar-based networks perform well on real-time inference, but they are not as accurate as voxel-based networks. In this paper, we propose voxel-to-pillar knowledge distillation (VOP KD) to transfer rich knowledge from voxel-based to pillar-based networks. With the help of high-confidence teacher predictions, we calculate two distillation losses to help the student learn from the teacher without introducing additional cost during inference. We conduct experiments on nuScenes and the results demonstrate that our proposed VOP KD effectively improves the mean average precision and nuScenes detection score of the student detector.
What problem does this paper attempt to address?