Abstract:Expanding the receptive field in a deep learning model for large-scale 3D point cloud segmentation is an effective technique for capturing rich contextual information, which consequently enhances the network's ability to learn meaningful features. However, this often leads to increased computational complexity and risk of overfitting, challenging the efficiency and effectiveness of the learning paradigm. To address these limitations, we propose the Local Split Attention Pooling (LSAP) mechanism to effectively expand the receptive field through a series of local split operations, thus facilitating the acquisition of broader contextual knowledge. Concurrently, it optimizes the computational workload associated with attention-pooling layers to ensure a more streamlined processing workflow. Based on LSAP, a Parallel Aggregation Enhancement (PAE) module is introduced to enable parallel processing of data using both 2D and 3D neighboring information to further enhance contextual representations within the network. In light of the aforementioned designs, we put forth a novel framework, designated as LSNet, for large-scale point cloud semantic segmentation. Extensive evaluations demonstrated the efficacy of seamlessly integrating the proposed PAE module into existing frameworks, yielding significant improvements in mean intersection over union (mIoU) metrics, with a notable increase of up to 11%. Furthermore, LSNet demonstrated superior performance compared to state-of-the-art semantic segmentation networks on three benchmark datasets, including S3DIS, Toronto3D, and SensatUrban. It is noteworthy that our method achieved a substantial speedup of approximately 38.8% compared to those employing similar-sized receptive fields, which serves to highlight both its computational efficiency and practical utility in real-world large-scale scenes.

An Adaptive Post-Processing Network with the Global-Local Aggregation for Semantic Segmentation

APPFNet: Adaptive point-pixel fusion network for 3D semantic segmentation with neighbor feature aggregation

Attention Guided Global Enhancement and Local Refinement Network for Semantic Segmentation

Global Context Dependencies Aware Network for Efficient Semantic Segmentation of Fine-Resolution Remoted Sensing Images

Patch Proposal Network for Fast Semantic Segmentation of High-Resolution Images

APP-Net: Auxiliary-Point-Based Push and Pull Operations for Efficient Point Cloud Recognition

APNet: Attention Mechanism with Point Sampling Loss Network for Remote Sensing Images Semantic Segmentation

Global Aggregation then Local Distribution for Scene Parsing

Real-Time Semantic Segmentation via Spatial-Detail Guided Context Propagation

Gated Path Selection Network for Semantic Segmentation

Hybrid Dilated Convolution Network Using Attentive Kernels for Real-Time Semantic Segmentation

Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation

Lightweight and Progressively-Scalable Networks for Semantic Segmentation

NeiEA-NET: Semantic segmentation of large-scale point cloud scene via neighbor enhancement and aggregation

SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation

Edge-Enhanced GCIFFNet: A Multiclass Semantic Segmentation Network Based on Edge Enhancement and Multiscale Attention Mechanism

Exploiting Local and Global Structure for Point Cloud Semantic Segmentation with Contextual Point Representations

HCNet: Hierarchical Context Network for Semantic Segmentation

Consistency-Regularized Region-Growing Network for Semantic Segmentation of Urban Scenes With Point-Level Annotations

HSNet: an Intelligent Hierarchical Semantic-Aware Network System for Real-Time Semantic Segmentation