GAB-Net: A Robust Detector for Remote Sensing Object Detection Under Dramatic Scale Variation and Complex Backgrounds

Hongyu Zhang,Yunbo Rao,Jie Shao,Fanman Meng,Naveed Ahmad
DOI: https://doi.org/10.1109/lgrs.2023.3325410
IF: 5.343
2023-11-01
IEEE Geoscience and Remote Sensing Letters
Abstract:Detecting objects in remote sensing images (RSIs), characterized by dramatic scale variation and complex backgrounds, has always been a challenging problem. These challenges can be further summarized into three aspects: 1) scale variation among objects; 2) feature fusion misalignment due to the semantic gap between adjacent feature layers and noise from backgrounds; and 3) boundary uncertainty under ambiguous and complex backgrounds. To alleviate these problems, we first utilize a global–local feature enhancement module (GLFEM) to capture local features with multiple receptive fields through cheap pooling operation and obtain global features through nonlocal block, thus alleviating the scale variation issues. Subsequently, attentional feature fusion alignment (AFFA) module is designed to align adjacent feature levels in the feature pyramid from pixel and channel levels. Finally, boundary-uncertainty aware head (BUAH) with distribution focal loss (DFL) is adopted to solve the boundary uncertainty problems. After fusing GLFEM, AFFA, and BUAH modules, we obtain GAB-Net. GAB-Net outperforms state-of-the-art methods on the Dior and NWPU VHR-10 datasets, achieving mAP scores of 73.8% and 89.8%, respectively, without adding high computational costs. The code is available at: https://github.com/Hong-yu-Zhang/GAB-Net.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?