A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance

Andrea Matteazzi,Pascal Colling,Michael Arnold,Dietmar Tutsch
2024-05-16
Abstract:In recent years considerable research in LiDAR semantic segmentation was conducted, introducing several new state of the art models. However, most research focuses on single-scan point clouds, limiting performance especially in long distance outdoor scenarios, by omitting time-sequential information. Moreover, varying-density and occlusions constitute significant challenges in single-scan approaches. In this paper we propose a LiDAR point cloud preprocessing and postprocessing method. This multi-stage approach, in conjunction with state of the art models in a multi-scan setting, aims to solve those challenges. We demonstrate the benefits of our method through quantitative evaluation with the given models in single-scan settings. In particular, we achieve significant improvements in mIoU performance of over 5 percentage point in medium range and over 10 percentage point in far range. This is essential for 3D semantic scene understanding in long distance as well as for applications where offline processing is permissible.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address several key issues in long-range LiDAR semantic segmentation. Specifically: 1. **Limitations of single-scan point clouds**: Most existing studies focus on single-scan point clouds, ignoring temporal sequence information, which limits performance in long-range outdoor scenes. 2. **Varying density and occlusion problems**: Single-scan methods struggle to handle varying-density and occlusions in point cloud data, especially in distant areas. 3. **Improving long-range segmentation accuracy**: Existing methods mainly focus on improving segmentation performance of near-range point clouds, while the segmentation performance of long-range point clouds is relatively low. To address these issues, the authors propose a voxel-based LiDAR point cloud preprocessing and postprocessing method. This method combines multi-scan point cloud information, maintaining a fixed structure while increasing the point cloud density in mid-to-long-range areas, thereby significantly improving the semantic segmentation performance in these regions. Specifically, the method improves the mean Intersection over Union (mIoU) by more than 5 percentage points in the mid-range and by more than 10 percentage points in the long-range. This is crucial for applications requiring high-precision long-range 3D scene understanding. Additionally, the method has demonstrated its effectiveness in offline processing scenarios.