Abstract:Semantic segmentation based on LiDAR plays an important role in the environment perception, path planning, and decision control of unmanned ground systems. However, in challenging environments with complex background interleaving, the high proportion of drivable areas and significant target elements pose serious challenges to segmentation tasks. Existing studies attempt to address these challenges but still grapple with issues such as interclass semantic conflicts and inaccurate edge segmentation. To address these challenges effectively, a reliable point cloud segmentation network driven by multiview fusion and multiedge guidance is specially customized. Specifically, the network's point cloud multiview range image view (RIV) and bird's eye view (BEV) encoding backbones use lightweight pyramid architectures to extract specific details and semantic features of multiscale point clouds. Subsequently, a dedicated multiview edge guidance module focuses on enhancing interclass edge differentiation features from RIV and BEV, respectively. These features are fused with the encoding backbone hierarchy, and the extraction of edge information is supervised by a new hybrid edge loss to maximize the consistency of semantic segmentation and predicted edges. Furthermore, a multiview and multiscale feature fusion module based on a multihead attention mechanism is introduced to enhance the potential complementarity and interaction of the captured features. Through comprehensive experiments and ablation studies on the SemanticKITTI and RELLIS-3D datasets, recognized in the field of autonomous driving, the results demonstrate that our custom method is competitive in terms of intersection over union (IoU) accuracy and real-time indicators compared with typical baseline methods.

TPV-IGKD: Image-Guided Knowledge Distillation for 3D Semantic Segmentation with Tri-Plane-View

Pass3d: Precise And Accelerated Semantic Segmentation For 3d Point Cloud

Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

Accurate 3-D Semantic Segmentation of Point Clouds for Intelligent Vehicles Based on Multiview Edge Guidance and Fusion

Accurate 3D Semantic Segmentation of Point Clouds for Intelligent Vehicles Based on Multi-view Edge Guidance and Fusion

Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation

A Multi-View-Assisted Semantic Segmentation Network on LiDAR Via Multi-Level Mutual Learning Knowledge Distillation

Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Robust 3D Semantic Segmentation Method Based on Multi-Modal Collaborative Learning

Uni-to-Multi Modal Knowledge Distillation for Bidirectional LiDAR-Camera Semantic Segmentation

CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation

HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation

Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving

Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

LiDAR-Based Real-Time Panoptic Segmentation via Spatiotemporal Sequential Data Fusion

Robust 3D Semantic Segmentation Based on Multi-Phase Multi-Modal Fusion for Intelligent Vehicles

2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds