Deep Convolutional Feature Enhancement for Remote Sensing Object Detection
Zhigang Yang,Yiming Liu,Zehao Gao,Guiwei Wen,Wei Emma Zhang,Yihan Xiao
DOI: https://doi.org/10.1109/lgrs.2024.3484652
IF: 5.343
2024-11-05
IEEE Geoscience and Remote Sensing Letters
Abstract:Convolutional neural networks (CNNs) have witnessed the flourishing development of remote sensing object detection. For CNN-based detectors, obtaining high-quality deep convolutional features is the foundation for improving the overall detection performance due to the challenging characteristics of remote sensing images, such as large variations in object shape, size, and orientation. In this letter, we present a channel-assisted deformable convolution (CDC) backbone and a bidirectional FPN with concise cross-layer connections (CFPN) for effective feature enhancement, which can significantly improve the detection accuracy with an acceptable complexity increment. In the proposed backbone, CDC modules are embedded into the top two layers of ResNet, where deformable convolution (DC) can guide the network to adapt to the various shapes of objects and channel attention (CA) can address the limitation that DC only focuses on the spatial dimensions while ignores the deep channel information. Our CFPN is a simple yet effective strategy to solve the problem that a complex feature fusion structure leads to a decrement in detection accuracy when cooperating with our enhanced backbone. Qualitative and quantitative experiments on a widely recognized benchmark, namely, DOTA, demonstrate that our method achieves a competitive performance against other advanced models.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics