Abstract:We consider the problem of cross-sensor domain adaptation in the context of LiDAR-based 3D object detection and propose Stationary Object Aggregation Pseudo-labelling (SOAP) to generate high quality pseudo-labels for stationary objects. In contrast to the current state-of-the-art in-domain practice of aggregating just a few input scans, SOAP aggregates entire sequences of point clouds at the input level to reduce the sensor domain gap. Then, by means of what we call quasi-stationary training and spatial consistency post-processing, the SOAP model generates accurate pseudo-labels for stationary objects, closing a minimum of 30.3% domain gap compared to few-frame detectors. Our results also show that state-of-the-art domain adaptation approaches can achieve even greater performance in combination with SOAP, in both the unsupervised and semi-supervised settings.

What problem does this paper attempt to address?

This paper mainly explores the problem of cross-sensor domain adaptation in 3D object detection. The authors propose a method called SOAP (Stationary Object Aggregation Pseudo-labelling) to reduce the domain gap between different sensors. SOAP reduces the scan pattern differences by aggregating the point cloud of the entire sequence, thereby improving the quality of pseudo-labels for static objects. Compared to the current practice of only aggregating a few input frames, SOAP can more effectively handle cross-sensor data. In autonomous driving and other safety-critical robot applications, LiDAR sensors are used to provide precise 3D localization of objects. However, detectors trained on datasets in one domain experience significant performance degradation when dealing with data from another domain. SOAP leverages sequence-level full-sequence aggregation to generate accurate pseudo-labels for static objects, thereby improving the detection performance in cross-sensor scenarios. It also combines with other domain adaptation methods to further improve performance in both unsupervised and semi-supervised settings. Experiments show that SOAP can close at least 30.3% of the domain gap, and achieves better results when combined with existing state-of-the-art domain adaptation methods. The paper also mentions that although full-sequence aggregation may not be optimal in an intra-domain setting, it can improve performance in cross-sensor scenarios. SOAP is applicable in situations without target domain labels or with a small number of labels, and can enhance the performance of existing detectors. Overall, the paper addresses how to improve the adaptability of 3D object detectors between different sensor data by effectively utilizing full-sequence aggregation, improving the accuracy of static object detection, and demonstrating its synergy with other methods.

SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling

Source data‐free domain adaptation of object detector through domain‐specific perturbation

PLS: Unsupervised Domain Adaptation for 3d Object Detection Via Pseudo-Label Sizes

Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection

SSC3OD: Sparsely Supervised Collaborative 3D Object Detection from LiDAR Point Clouds

STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning

Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection

Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling

SPG: Unsupervised Domain Adaptation for 3D Object Detection via Semantic Point Generation

Complete & Label: A Domain Adaptation Approach to Semantic Segmentation of LiDAR Point Clouds

Monocular 3D Object Detection via Feature Domain Adaptation

SAR-CDSS: A Semi-Supervised Cross-Domain Object Detection from Optical to SAR Domain

Adaptation Via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3d Object Detection

Semi-Supervised 3d Object Detection Via Adaptive Pseudo-Labeling

GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds

MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object Detection

S4OD: Semi-Supervised learning for Single-Stage Object Detection

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection

Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection

ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only

Spatial Alignment for Unsupervised Domain Adaptive Single-Stage Object Detection