GODANet: an object detection model for remote sensing images fusing contextual information and dynamic convolution

Xing Rong,Zhihua Zhang,Hao Yuan,Shaobin Zhang
DOI: https://doi.org/10.1117/1.jrs.18.016507
IF: 1.568
2024-02-28
Journal of Applied Remote Sensing
Abstract:Remote sensing images are characterized by complex feature backgrounds and large target scale differences, so object detection for remote sensing images is a challenging problem. This work proposes a one-stage structure remote sensing image object detection model called GODANet. First, the GODANet incorporates a Global Context Network (GCNet) in the feature extraction structure. The GCNet focuses the model on the image region of interest from a global perspective. Second, the output layer utilizes an omni-dimensional dynamic convolution technique, allowing for more flexible adaptation to targets or edges in specific regions. Finally, an adaptive spatial feature fusion structure, IR-ASFF, which fuses improved-RFB (IRFB) modules is proposed to fuse the critical information of multiple levels of features to realize the adaptability to object detection at different scales. The GODANet efficiently aggregates network performance and possesses two main advantages: adaptability to multi-scale targets and focus on features of interest. The mean average precision (mAP) on the DIOR dataset and the NWPU VHR-10 dataset reached 93.7% and 92.9%, respectively, and compared with YOLOv7, the mAP was improved by 3.1% and 0.3%, respectively. Therefore, we believe the GODANet suits remote sensing image detection tasks.
environmental sciences,imaging science & photographic technology,remote sensing
What problem does this paper attempt to address?