SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling

Chengjie Huang,Vahdat Abdelzad,Sean Sedwards,Krzysztof Czarnecki
2024-01-09
Abstract:We consider the problem of cross-sensor domain adaptation in the context of LiDAR-based 3D object detection and propose Stationary Object Aggregation Pseudo-labelling (SOAP) to generate high quality pseudo-labels for stationary objects. In contrast to the current state-of-the-art in-domain practice of aggregating just a few input scans, SOAP aggregates entire sequences of point clouds at the input level to reduce the sensor domain gap. Then, by means of what we call quasi-stationary training and spatial consistency post-processing, the SOAP model generates accurate pseudo-labels for stationary objects, closing a minimum of 30.3% domain gap compared to few-frame detectors. Our results also show that state-of-the-art domain adaptation approaches can achieve even greater performance in combination with SOAP, in both the unsupervised and semi-supervised settings.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper mainly explores the problem of cross-sensor domain adaptation in 3D object detection. The authors propose a method called SOAP (Stationary Object Aggregation Pseudo-labelling) to reduce the domain gap between different sensors. SOAP reduces the scan pattern differences by aggregating the point cloud of the entire sequence, thereby improving the quality of pseudo-labels for static objects. Compared to the current practice of only aggregating a few input frames, SOAP can more effectively handle cross-sensor data. In autonomous driving and other safety-critical robot applications, LiDAR sensors are used to provide precise 3D localization of objects. However, detectors trained on datasets in one domain experience significant performance degradation when dealing with data from another domain. SOAP leverages sequence-level full-sequence aggregation to generate accurate pseudo-labels for static objects, thereby improving the detection performance in cross-sensor scenarios. It also combines with other domain adaptation methods to further improve performance in both unsupervised and semi-supervised settings. Experiments show that SOAP can close at least 30.3% of the domain gap, and achieves better results when combined with existing state-of-the-art domain adaptation methods. The paper also mentions that although full-sequence aggregation may not be optimal in an intra-domain setting, it can improve performance in cross-sensor scenarios. SOAP is applicable in situations without target domain labels or with a small number of labels, and can enhance the performance of existing detectors. Overall, the paper addresses how to improve the adaptability of 3D object detectors between different sensor data by effectively utilizing full-sequence aggregation, improving the accuracy of static object detection, and demonstrating its synergy with other methods.