KLDet: Detecting Tiny Objects in Remote Sensing Images via Kullback–Leibler Divergence
Zhuangzhuang Zhou,Yingying Zhu
DOI: https://doi.org/10.1109/tgrs.2024.3382099
IF: 8.2
2024-04-09
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Remote sensing images (RSIs) frequently contain quite a few tiny objects with a finite number of pixels to study. The limited spatial information poses a challenge for extracting discriminative features for representing the characteristics of tiny objects. Existing solutions mainly focus on aggregating contextual information at different levels, while rarely touching the step that is crucial for model training, i.e., label assignment. Tiny instances occupy fairly small regions of images and have limited overlaps to priors (anchors or dots), which is a dilemma for traditional label assignment strategies. Despite being simple and effective, the mainstream Intersection over Union (IoU)-based label assignment strategy struggles to accurately measure the localization of tiny bounding boxes. In contrast, the Kullback–Leibler divergence (KLD) localization metric accurately reflects minor offsets of tiny bounding boxes. More importantly, KLD is able to measure non-overlapping bounding boxes, providing an advantage in mining more potential positive samples of tiny objects. In this article, from a cost-efficient point of view, we detect tiny objects through KLD in the form of single-stage framework. Specifically, we model the parameterized bounding box as a 2-D Gaussian distribution (Bbox2Gaussian) in order to use KLD as a localization metric. Then, we propose an adaptive online training sample mining (Ali-TSM) strategy based on inter-distribution similarity, which selects high-quality positive samples by considering localization and classification rather than just centroid distance or IoU. Finally, task-level attention (TlA) is introduced to guide the model in freely selecting the appropriate features for the classification or regression task. We conducted extensive experiments on four popular public datasets. Compared to the baseline, KLDet improves performance on Tiny Object Detection in Aerial Images (AI-TOD) and object DetectIon in Optical Remote sensing image (DIOR) by 4.1 AP and 6.7 mAP. On VisDrone and Small Object Detection dAtasets (SODA-D), KLDet exhibits superior performance than baseline. The code is available at https://github.com/TinyOD/mmdet-kldet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics