Abstract:Drone-based object detection in adverse weather conditions is crucial for enhancing drones' environmental perception, yet it remains largely unexplored due to the lack of relevant benchmarks. To bridge this gap, we introduce HazyDet, a large-scale dataset tailored for drone-based object detection in hazy scenes. It encompasses 383,000 real-world instances, collected from both naturally hazy environments and normal scenes with synthetically imposed haze effects to simulate adverse weather conditions. By observing the significant variations in object scale and clarity under different depth and haze conditions, we designed a Depth Conditioned Detector (DeCoDet) to incorporate this prior knowledge. DeCoDet features a Multi-scale Depth-aware Detection Head that seamlessly integrates depth perception, with the resulting depth cues harnessed by a dynamic Depth Condition Kernel module. Furthermore, we propose a Scale Invariant Refurbishment Loss to facilitate the learning of robust depth cues from pseudo-labels. Extensive evaluations on the HazyDet dataset demonstrate the flexibility and effectiveness of our method, yielding significant performance improvements. Our dataset and toolkit are available at <a class="link-external link-https" href="https://github.com/GrokCV/HazyDet" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

This paper attempts to address the challenges faced by unmanned aerial vehicles (UAVs) in object detection under adverse weather conditions, especially in hazy weather. Specifically, the paper proposes solutions to the following problems: 1. **Lack of object - detection benchmark datasets suitable for the UAV perspective**: Most of the existing datasets mainly focus on ideal, sunny weather conditions and cannot fully reflect the actual application scenarios of UAVs in adverse weather such as haze. 2. **Specialties under the UAV perspective**: - **Scale variation**: The size of objects in images captured by UAVs can vary significantly due to changes in altitude and angle, resulting in an increase in the proportion of small objects. - **Uneven distribution**: The distribution of objects in UAV images is irregular, which is different from the concentrated distribution under the normal perspective. 3. **The impact of adverse weather conditions on image quality**: - **Image degradation**: Hazy weather can reduce the atmospheric transmission efficiency, leading to a decrease in image visibility and color distortion, which affects subsequent vision - based perception tasks. - **Domain gap**: Image degradation caused by weather can affect feature recognition, making it difficult for neural networks to process blurred and semantically ambiguous features, thus resulting in a large domain gap. To solve these problems, the author proposes the following solutions: - **HazyDet dataset**: A large - scale dataset has been constructed, containing 383,000 real - world instances, covering normal scenes in natural hazy environments and with synthetic hazy effects. This dataset fills the resource gap for UAVs in object detection under adverse weather. - **Depth - information - guided detection framework (DeCoDet)**: A new detection framework has been introduced, which uses depth information to improve the detection performance of UAVs in hazy conditions. This framework effectively copes with the challenges brought by the UAV perspective and hazy environment by dynamically adjusting the detection strategy based on depth cues. - **Benchmark testing and leaderboard**: A comprehensive quantitative and qualitative evaluation of the state - of - the - art detection and defogging methods has been carried out using the HazyDet dataset, and a benchmark and leaderboard have been established, providing a platform for the research community to understand the limitations of existing methods and develop robust solutions. ### Formula representation The formulas involved in the paper are as follows: - The classic formula for hazy image generation: \[ I(x, y) = J(x, y)t(x, y)+A(1 - t(x, y)) \] where \(I(x, y)\) is the observed hazy image, \(J(x, y)\) is the recoverable scene radiance, \(A\) represents the global atmospheric light, and \(t(x, y)\) is the transmission matrix, defined as: \[ t(x, y)=e^{-\beta d(x, y)} \] In this formula, \(\beta\) represents the atmospheric scattering coefficient, and \(d(x, y)\) represents the relative distance between the scene object and the camera. These formulas are used to simulate and generate realistic hazy images to support the development and evaluation of algorithms.

HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes

DroneNet: Rescue Drone-View Object Detection

An Oriented Object Detector for Hazy Remote Sensing Images

Detection-Driven Exposure-Correction Network for Nighttime Drone-View Object Detection

CSSDet: small object detection via cross-scale feature enhancement on drone-view images

Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes

Plug-and-Play Robust Aerial Object Detection Under Hazy Conditions

VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results

Hybrid receptive field network for small object detection on drone view

Depth-Guided Dehazing Network for Long-Range Aerial Scenes

PTCDet: advanced UAV imagery target detection

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-Based 3-D Object Detection

DALDet: Depth-Aware Learning Based Object Detection for Autonomous Driving

Photo realistic synthetic dataset and multi-scale attention dehazing network

Multi-Task Learning for UAV Aerial Object Detection in Foggy Weather Condition

Data-driven visibility enhancement using multi-camera system

SimHaze: game engine simulated data for real-world dehazing

VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results

An Adaptive Multimodal Fusion 3D Object Detection Algorithm for Unmanned Systems in Adverse Weather

Dense Haze: A benchmark for image dehazing with dense-haze and haze-free images

Oriented Object Detection via Contextual Dependence Mining and Penalty-Incentive Allocation