Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection

Colin Decourt,Rufin VanRullen,Didier Salle,Thomas Oberlin
2024-02-13
Abstract:In recent years, driven by the need for safer and more autonomous transport systems, the automotive industry has shifted toward integrating a growing number of Advanced Driver Assistance Systems (ADAS). Among the array of sensors employed for object recognition tasks, radar sensors have emerged as a formidable contender due to their abilities in adverse weather conditions or low-light scenarios and their robustness in maintaining consistent performance across diverse environments. However, the small size of radar datasets and the complexity of the labelling of those data limit the performance of radar object detectors. Driven by the promising results of self-supervised learning in computer vision, this paper presents RiCL, an instance contrastive learning framework to pre-train radar object detectors. We propose to exploit the detection from the radar and the temporal information to pre-train the radar object detection model in a self-supervised way using contrastive learning. We aim to pre-train an object detector's backbone, head and neck to learn with fewer data. Experiments on the CARRADA and the RADDet datasets show the effectiveness of our approach in learning generic representations of objects in range-Doppler maps. Notably, our pre-training strategy allows us to use only 20% of the labelled data to reach a similar mAP@0.5 than a supervised approach using the whole training set.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the challenge of data annotation in radar object detection and proposes a novel self-supervised instance contrastive learning framework (RiCL) to reduce the dependency on large-scale annotated datasets. Specifically, the main objectives of the paper include: 1. **Reducing annotation workload**: By utilizing the target list generated by the radar sensor processing chain to create object proposals, pre-training can be conducted without extensive manual annotation, thereby reducing the annotation workload. 2. **Enhancing detection performance**: By leveraging temporal information and radar detection outputs, the entire radar object detection model (including the backbone, neck, and head) is pre-trained in a self-supervised manner to improve the model's generalization ability across different tasks. 3. **Validating the method's effectiveness**: Experiments on the CARRADA and RADDet datasets demonstrate that even with a small amount of annotated data, the method can achieve performance levels similar to or better than fully supervised methods. The paper showcases the effectiveness of the proposed method through comparative experiments, particularly highlighting its superior performance with small data volumes compared to random initialization methods, and proving the method's good generalization ability. Additionally, the authors discuss future research directions, such as comparing with other radar pre-training strategies.