Local-to-Global Perception Network for Point Cloud Segmentation

Haoxuan Wang,Ping Wei,Shuaijia Chen,Zhimin Liao,Jialu Qin
DOI: https://doi.org/10.1109/icme57554.2024.10687969
2024-01-01
Abstract:LiDAR-based point cloud segmentation is a significant and challenging task for 3D scene understanding. Recent voxel-based methods are often built on submanifold sparse residual calculation with small kernel size, which limits the local feature interactions and neglects the global contextual information. In this paper, we propose a local-to-global perception LiDAR-based point cloud segmentation network LGPSeg. From the local perspective, we design the Dynamic Spatial Aggregation Convolution to expand the receptive field range while avoiding a large increase in the model parameters. From the global perspective, we propose the BEV-Voxel Fusion to aggregate the global contextual information on the BEV feature maps through advanced 2D operators. By combining the local and global features, the 3D object and background information can be better captured. Our method achieves state-of-the-art results on two large datasets, SemanticKITTI and nuScenes, and even outperformed multimodal-based methods.
What problem does this paper attempt to address?