Abstract:Semantic segmentation based on LiDAR plays an important role in the environment perception, path planning, and decision control of unmanned ground systems. However, in challenging environments with complex background interleaving, the high proportion of drivable areas and significant target elements pose serious challenges to segmentation tasks. Existing studies attempt to address these challenges but still grapple with issues such as interclass semantic conflicts and inaccurate edge segmentation. To address these challenges effectively, a reliable point cloud segmentation network driven by multiview fusion and multiedge guidance is specially customized. Specifically, the network's point cloud multiview range image view (RIV) and bird's eye view (BEV) encoding backbones use lightweight pyramid architectures to extract specific details and semantic features of multiscale point clouds. Subsequently, a dedicated multiview edge guidance module focuses on enhancing interclass edge differentiation features from RIV and BEV, respectively. These features are fused with the encoding backbone hierarchy, and the extraction of edge information is supervised by a new hybrid edge loss to maximize the consistency of semantic segmentation and predicted edges. Furthermore, a multiview and multiscale feature fusion module based on a multihead attention mechanism is introduced to enhance the potential complementarity and interaction of the captured features. Through comprehensive experiments and ablation studies on the SemanticKITTI and RELLIS-3D datasets, recognized in the field of autonomous driving, the results demonstrate that our custom method is competitive in terms of intersection over union (IoU) accuracy and real-time indicators compared with typical baseline methods.

SEG-VoxelNet for 3D Vehicle Detection from RGB and LiDAR Data

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Robust vehicle detection using 3D Lidar under complex urban environment

3D Vehicle Detection Using Cheap LiDAR and Camera Sensors.

Accurate 3-D Semantic Segmentation of Point Clouds for Intelligent Vehicles Based on Multiview Edge Guidance and Fusion

VIN: Voxel-based Implicit Network for Joint 3D Object Detection and Segmentation for Lidars

Accurate 3D Semantic Segmentation of Point Clouds for Intelligent Vehicles Based on Multi-view Edge Guidance and Fusion

Semantic-aware 3D-voxel CenterNet for point cloud object detection

Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving

3D Object Detection for Point Cloud in Virtual Driving Environment

3D Detection for Occluded Vehicles From Point Clouds

3D Fully Convolutional Network for Vehicle Detection in Point Cloud

Stereo RGB and Deeper LIDAR-Based Network for 3D Object Detection in Autonomous Driving.

Region-proposal Convolutional Network-driven Point Cloud Voxelization and Over-segmentation for 3D Object Detection

Image Guidance Based 3D Vehicle Detection in Traffic Scene.

PVI-Net: Point-Voxel-Image Fusion for Semantic Segmentation of Point Clouds in Large-Scale Autonomous Driving Scenarios

PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion

VS-Net: A Voxel Encoding and Sparse Convolution Embedded Network for LiDAR 3D Object Detection.

R-VPCG: RGB image feature fusion-based virtual point cloud generation for 3D car detection

RGB and LiDAR Fusion-based 3D Semantic Segmentation for Autonomous Driving

Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection