PointSSC: A Cooperative Vehicle-Infrastructure Point Cloud Benchmark for Semantic Scene Completion

Yuxiang Yan,Boda Liu,Jianfei Ai,Qinbu Li,Ru Wan,Jian Pu
2024-03-07
Abstract:Semantic Scene Completion (SSC) aims to jointly generate space occupancies and semantic labels for complex 3D scenes. Most existing SSC models focus on volumetric representations, which are memory-inefficient for large outdoor spaces. Point clouds provide a lightweight alternative but existing benchmarks lack outdoor point cloud scenes with semantic labels. To address this, we introduce PointSSC, the first cooperative vehicle-infrastructure point cloud benchmark for semantic scene completion. These scenes exhibit long-range perception and minimal occlusion. We develop an automated annotation pipeline leveraging Semantic Segment Anything to efficiently assign semantics. To benchmark progress, we propose a LiDAR-based model with a Spatial-Aware Transformer for global and local feature extraction and a Completion and Segmentation Cooperative Module for joint completion and segmentation. PointSSC provides a challenging testbed to drive advances in semantic point cloud completion for real-world navigation. The code and datasets are available at <a class="link-external link-https" href="https://github.com/yyxssm/PointSSC" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the deficiencies of existing Semantic Scene Completion (SSC) models when dealing with large - scale outdoor scenes. Specifically, the existing SSC models mainly focus on volume representation, which is inefficient and has high memory consumption when dealing with large outdoor spaces. In addition, the existing benchmarks lack outdoor point - cloud scenes with semantic labels. Therefore, this paper proposes PointSSC, which is the first collaborative vehicle - infrastructure point - cloud benchmark for semantic scene completion, aiming to solve the following problems: 1. **Limitations of the dataset**: Most of the existing SSC datasets rely on data collected by on - board sensors. These sensors have a limited sensing range and are easily affected by occlusion. For example, SemanticKITTI only provides semantic scenes from the front - view perspective, while SurroundOcc and OpenOccuPancy, although combined with surrounding perspectives, still cannot effectively handle occluded areas. Occ3D uses ray - casting to generate occlusion masks, but only for improving evaluation metrics, not for improving the quality of ground - truth labels. 2. **Complexity of outdoor scenes**: The existing vehicle - view - based datasets cannot capture long - distance perception and ubiquitous occlusion phenomena common in real - driving environments. Therefore, a dataset that can obtain data from an infrastructure perspective is needed to provide more abundant and complete semantic annotations. 3. **Effectiveness of the model**: To verify the effectiveness of the PointSSC dataset, this paper proposes a LiDAR - based model that contains a Spatial - Aware Transformer and a Completion and Segmentation Cooperative Module (CSCM) for jointly performing completion and segmentation tasks. By solving these problems, PointSSC aims to provide a challenging test platform for semantic point - cloud completion research in outdoor autonomous navigation and promote the further development of this field.