Abstract:With the rapid advances in autonomous driving, it becomes critical to equip its sensing system with more holistic 3D perception. However, widely explored tasks like 3D detection or point cloud semantic segmentation focus on parsing either the objects (e.g. cars and pedestrians) or scenes (e.g. trees and buildings). In this work, we propose to address the challenging task of LiDAR-based Panoptic Segmentation, which aims to parse both objects and scenes in a unified manner. In particular, we propose Dynamic Shifting Network (DS-Net), which serves as an effective panoptic segmentation framework in the point cloud realm. DS-Net features a dynamic shifting module for complex LiDAR point cloud distributions. We observe that commonly used clustering algorithms like BFS or DBSCAN are incapable of handling complex autonomous driving scenes with non-uniform point cloud distributions and varying instance sizes. Thus, we present an efficient learnable clustering module, dynamic shifting, which adapts kernel functions on the fly for different instances. To further explore the temporal information, we extend the single-scan processing framework to its temporal version, namely 4D-DS-Net, for the task of 4D Panoptic Segmentation, where the same instance across multiple frames should be given the same ID prediction. Instead of naïvely appending a tracking module to DS-Net, we propose to solve the 4D panoptic segmentation in a more unified way. Specifically, 4D-DS-Net first constructs 4D data volume by aligning consecutive LiDAR scans, upon which the temporally unified instance clustering is performed to obtain the final results. Extensive experiments on two large-scale autonomous driving LiDAR datasets, SemanticKITTI and Panoptic nuScenes, are conducted to demonstrate the effectiveness and superior performance of the proposed solution. The code is publicly available at https://github.com/hongfz16/DS-Net.

Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation

CenterLPS: Segment Instances by Centers for LiDAR Panoptic Segmentation

Pass3d: Precise And Accelerated Semantic Segmentation For 3d Point Cloud

Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

LiDAR-based Panoptic Segmentation via Dynamic Shifting Network

LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network

CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds

Directed Mix Contrast for Lidar Point Cloud Segmentation

A Technical Survey and Evaluation of Traditional Point Cloud Clustering Methods for LiDAR Panoptic Segmentation

Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment

LiDAR Panoptic Segmentation for Autonomous Driving

EfficientLPS: Efficient LiDAR Panoptic Segmentation

PV-RCNN++: Semantical Point-Voxel Feature Interaction for 3D Object Detection

Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks

LiDAR-Based Real-Time Panoptic Segmentation via Spatiotemporal Sequential Data Fusion

Scan-based Semantic Segmentation of LiDAR Point Clouds: An Experimental Study

Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel Matching

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation