Abstract:Expanding the receptive field in a deep learning model for large-scale 3D point cloud segmentation is an effective technique for capturing rich contextual information, which consequently enhances the network's ability to learn meaningful features. However, this often leads to increased computational complexity and risk of overfitting, challenging the efficiency and effectiveness of the learning paradigm. To address these limitations, we propose the Local Split Attention Pooling (LSAP) mechanism to effectively expand the receptive field through a series of local split operations, thus facilitating the acquisition of broader contextual knowledge. Concurrently, it optimizes the computational workload associated with attention-pooling layers to ensure a more streamlined processing workflow. Based on LSAP, a Parallel Aggregation Enhancement (PAE) module is introduced to enable parallel processing of data using both 2D and 3D neighboring information to further enhance contextual representations within the network. In light of the aforementioned designs, we put forth a novel framework, designated as LSNet, for large-scale point cloud semantic segmentation. Extensive evaluations demonstrated the efficacy of seamlessly integrating the proposed PAE module into existing frameworks, yielding significant improvements in mean intersection over union (mIoU) metrics, with a notable increase of up to 11%. Furthermore, LSNet demonstrated superior performance compared to state-of-the-art semantic segmentation networks on three benchmark datasets, including S3DIS, Toronto3D, and SensatUrban. It is noteworthy that our method achieved a substantial speedup of approximately 38.8% compared to those employing similar-sized receptive fields, which serves to highlight both its computational efficiency and practical utility in real-world large-scale scenes.

Dilated Nearest-Neighbor Encoding for 3D Semantic Segmentation of Point Clouds

Semantic segmentation of large-scale point clouds based on dilated nearest neighbors graph

Multi-Scale Point-Wise Convolutional Neural Networks for 3D Object Segmentation From LiDAR Point Clouds in Large-Scale Environments

Point Attention Network for Semantic Segmentation of 3D Point Clouds

A Multi-scale Network for Semantic Segmentation of 3D Point Clouds

Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation

Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion

NeiEA-NET: Semantic segmentation of large-scale point cloud scene via neighbor enhancement and aggregation

Fuzzy Neighborhood Learning for Deep 3-D Segmentation of Point Cloud.

S3Net: 3D LiDAR Sparse Semantic Segmentation Network

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based Perception

Weakly-Supervised Point Cloud Semantic Segmentation Based on Dilated Region

MS-RRFSegNet: Multiscale Regional Relation Feature Segmentation Network for Semantic Segmentation of Urban Scene Point Clouds.

Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling.

FA-ResNet: Feature affine residual network for large-scale point cloud segmentation

Multi-Feature Aggregation for Semantic Segmentation of an Urban Scene Point Cloud

Point-based Attention Convolutional Neural Networks for Point Clouds Semantic Segmentation

DPC-Net: Distributed Point Convolution Network for large-scale point clouds semantic segmentation

Encoding Discriminative Representation for Point Cloud Semantic Segmentation

LLGF-Net: Learning Local and Global Feature Fusion for 3D Point Cloud Semantic Segmentation