A Multiscale Information Fusion Network Based on PixelShuffle Integrated With YOLO for Aerial Remote Sensing Object Detection

Li Hu Xi,Jing Wei Hou,Guang Lin Ma,Yong Qiang Hei,Wen Tao Li
DOI: https://doi.org/10.1109/lgrs.2024.3353304
IF: 5.343
2024-02-07
IEEE Geoscience and Remote Sensing Letters
Abstract:Deep learning (DL)-based object detection has made tremendous progress in the detection of aerial remote sensing targets. However, the issue of similar targets and multiscale targets still becomes an obstacle in improving the detection accuracy. To address this issue, a multiscale information fusion network based on PixelShuffle integrated with YOLO (MPS-YOLO) is proposed. First, to reduce the loss of deep semantic feature information of similar targets in the process of feature fusion, the feature pyramid network based on PixelShuffle (FPN-P) is introduced. Second, aiming at the phenomenon that gets stuck in identifying multiscale targets, a multiscale receptive field (MRF) module is designed to fuse the multiscale information of the feature layer. Finally, to further enhance the detection result, an extra shallow feature (ESF) map is brought in to enrich the context information. Numerical results in public aerial remote sensing datasets show that the proposed algorithm enhances the detection accuracy by 4.15% and has preeminent robustness to difficult-to-identify targets.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?