Abstract:Three-dimensional point cloud data generally contain complex scene information and diversified category structures. Existing point cloud semantic segmentation networks tend to learn feature information between sampled center points and their neighboring points, while ignoring the scale and structural information of the spatial context of the sampled center points. To address these issues, this paper introduces PointNAC (PointNet based on normal vector and attention copula feature enhancement), a network designed for point cloud semantic segmentation in large-scale complex scenes, which consists of the following two main modules: (1) The local stereoscopic feature-encoding module: this feature-encoding process incorporates distance, normal vectors, and angles calculated based on the cosine theorem, enabling the network to learn not only the spatial positional information of the point cloud but also the spatial scale and geometric structure; and (2) the copula-based similarity feature enhancement module. Based on the stereoscopic feature information, this module analyzes the correlation among points in the local neighborhood. It enhances the features of positively correlated points while leaving the features of negatively correlated points unchanged. By combining these enhancements, it effectively enhances the feature saliency within the same class and the feature distinctiveness between different classes. The experimental results show that PointNAC achieved an overall accuracy (OA) of 90.9% and a mean intersection over union (MIoU) of 67.4% on the S3DIS dataset. And on the Vaihingen dataset, PointNAC achieved an overall accuracy (OA) of 85.9% and an average F1 score of 70.6%. Compared to the segmentation results of other network models on public datasets, our algorithm demonstrates good generalization and segmentation capabilities.

Learnable scene prior for point cloud semantic segmentation

GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

Semantic Context Encoding for Accurate 3D Point Cloud Segmentation

SCF-Net: Learning Spatial Contextual Features for Large-Scale Point Cloud Segmentation.

Semantic segmentation of large-scale point clouds based on dilated nearest neighbors graph

NeiEA-NET: Semantic segmentation of large-scale point cloud scene via neighbor enhancement and aggregation

Weakly Supervised Point Cloud Segmentation Via Deep Morphological Semantic Information Embedding

Compensating for Local Ambiguity With Encoder-Decoder in Urban Scene Segmentation

Fast Context-Awareness Encoder for LiDAR Point Semantic Segmentation

Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion

3D Scene Graph Generation from Point Clouds

Large-scale point cloud semantic segmentation via local perception and global descriptor vector

Dilated Nearest-Neighbor Encoding for 3D Semantic Segmentation of Point Clouds

Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding

A Two-Pipeline Instance Segmentation Network via Boundary Enhancement for Scene Understanding

Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds

Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Semantic Segmentation of Aerial Laser Point Clouds Based on Deep-Residual Enhanced Coding of Multi-Feature Information

PointNAC: Copula-Based Point Cloud Semantic Segmentation Network

LLGF-Net: Learning Local and Global Feature Fusion for 3D Point Cloud Semantic Segmentation

Learning Segmented 3D Gaussians via Efficient Feature Unprojection for Zero-shot Neural Scene Segmentation