Object Detection in Remote Sensing Images With Parallel Feature Fusion and Cascade Global Attention Head

Zhigang Yang,Yiming Liu,Guiwei Wen,Xiangyu Xia,Wei Emma Zhang,Tao Chen
DOI: https://doi.org/10.1109/lgrs.2024.3385231
IF: 5.343
2024-04-16
IEEE Geoscience and Remote Sensing Letters
Abstract:Convolutional neural networks (CNNs) have driven significant development in remote sensing (RS) object detection. To achieve concise and effective optimization, we propose a two-stage detector with a parallel feature fusion strategy and a cascade global attention (GA) mechanism for object detection in RS images, named PC-RCNN. We first design a feature pyramid network with two parallel branches (PB-FPN), corresponding to the top-down and bottom-up feature fusion pathways, respectively. Different optimization modules can be adopted in different pathways to avoid potential module incompatibility when connected in series. Such parallel feature fusion strategy can achieve both higher detection accuracy and higher computational efficiency compared with previous series fusion modes. Furthermore, we design a GA block to enhance feature representations of regions and propose a cascade GA head network (CGA-Head) for accurate category prediction and location estimation. Experiments on a challenging large-scale dataset, namely DOTA, show that the proposed PC-RCNN achieves a mean average precision (mAP) of 77.63%, which is comparable to other state-of-the-art CNN-based models. Parallel feature fusion, cascade GA, object detection, RS images.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?