MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh,Hyung-Il Kim,Seong Tae Kim,Jung Uk Kim
2024-07-23
Abstract:Monocular 3D object detection is an important challenging task in autonomous driving. Existing methods mainly focus on performing 3D detection in ideal weather conditions, characterized by scenarios with clear and optimal visibility. However, the challenge of autonomous driving requires the ability to handle changes in weather conditions, such as foggy weather, not just clear weather. We introduce MonoWAD, a novel weather-robust monocular 3D object detector with a weather-adaptive diffusion model. It contains two components: (1) the weather codebook to memorize the knowledge of the clear weather and generate a weather-reference feature for any input, and (2) the weather-adaptive diffusion model to enhance the feature representation of the input feature by incorporating a weather-reference feature. This serves an attention role in indicating how much improvement is needed for the input feature according to the weather conditions. To achieve this goal, we introduce a weather-adaptive enhancement loss to enhance the feature representation under both clear and foggy weather conditions. Extensive experiments under various weather conditions demonstrate that MonoWAD achieves weather-robust monocular 3D object detection. The code and dataset are released at <a class="link-external link-https" href="https://github.com/VisualAIKHU/MonoWAD" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the robustness of monocular 3D object detection under different weather conditions in the autonomous driving scenario. Existing methods mainly focus on 3D detection under ideal weather conditions (such as sunny days), but in practical applications, autonomous driving needs to deal with various adverse weather conditions (such as foggy days), and these conditions will significantly reduce the performance of object detection. Specifically, the paper proposes MonoWAD (Weather - Adaptive Diffusion Model for Robust Monocular 3D Object Detection), aiming to improve the robustness of monocular 3D object detection under different weather conditions, especially for foggy days which have a great impact on visual information. To achieve this goal, the author introduces two key components: 1. **Weather Codebook**: It is used to memorize the knowledge of sunny days and generate weather reference features for any input. This helps to indicate how much improvement the input features need under different weather conditions. 2. **Weather - Adaptive Diffusion Model**: By combining weather reference features, it dynamically enhances the representation of input features to adapt to different weather conditions. In addition, the author also introduces the Weather - Adaptive Enhancement Loss to ensure that the model can perform well under both sunny and foggy conditions. In summary, the core problem of this paper is to improve the robustness and accuracy of monocular 3D object detection under various weather conditions, especially in challenging weather conditions such as foggy days.