AFANet: A Multibackbone Compatible Feature Fusion Framework for Effective Remote Sensing Object Detection

Qingming Yi,Mingfeng Zheng,Min Shi,Jian Weng,Aiwen Luo
DOI: https://doi.org/10.1109/lgrs.2024.3462089
IF: 5.343
2024-10-01
IEEE Geoscience and Remote Sensing Letters
Abstract:Remote sensing object detection (RSOD) using convolutional neural networks (CNNs) continues to pose challenges in achieving high detection accuracy due to the inherent complexity of remote sensing images, characterized by intricate backgrounds, massive multiscale objects with irregular shapes, and significant variations. In addition, existing RSOD methods often rely on a particular backbone architecture, hindering their adaptability to achieve high accuracy across diverse networks with varying backbones. To address these challenges, we propose a novel multibackbone compatible feature fusion framework termed attention-aware feature aggregation network (AFANet). First, a multibranch attention-based semantic aggregation (MASA) module is introduced to adaptively capture the high-level semantic information. Second, the multiscale spatial features are integrated with the semantic information using a self-attention-guided global contextual feature fusion (SGCFF) strategy. Finally, we incorporate a dual-attention mechanism to capture more fine-grained features to detect small objects. Extensive experiments on the DIOR and NWPU VHR-10 datasets demonstrate the effectiveness of the proposed AFANet across various backbones, achieving superior detection accuracy. The code is available at https://github.com/lawlawCodes/AFANet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?