YOLO-PDNet: Small Target Recognition Improvement for Remote Sensing Image Based on YOLOv8

XiaoDong Liu,Hao Zhang,Wenyin Gong,Xiang Li
DOI: https://doi.org/10.1109/ijcnn60899.2024.10650721
2024-01-01
Abstract:RSI-based (remote sensing image) target detection holds a significant position in the field within the domain of visual computing. However, recognizing small target objects within RSI poses a significant hurdle due to their often-elusive nature. To confront this challenge, the paper introduces a novel neural network YOLO-PDNet. This network is an extension of YOLOv8, integrating a rapid neural architecture, proficient convolution operator, and a loss function based on normalized Wasserstein distance. This novel strategy seeks to improve the precision and effectiveness of target detection in remote sensing image. First, in order to boost the inference speed and minimize parameter load, we introduce a partial and effective convolution module (CPF) and enhanced SPPFCSPC module. Second, the Convolutional Depthwise Efficient Feature (CDEF) is designed to enhance feature extraction ability. Simultaneously, it ensures shorter inference time by utilizing separate convolutions. Finally, to alleviate the strong sensitivity of joint intersection (IoU)-based metrics to slight positional deviations of tiny objects, we apply the NWDL (normalized Wasserstein distance loss) loss function. Thorough comparisons and analysis showcase the heightened detection performance of our YOLO-PDNet algorithm, surpassing contemporary models in terms of detection precision. YOLO-PDNet has notably achieved an impressive 69.6% mAP on the DOTA v1.5 dataset and an outstanding 93.6% mAP on the NWPUVHR-10 dataset, surpassing YOLOv8 by an average of more than 1%. This significant enhancement in performance underscores the efficacy of the innovative YOLO-PDNet architecture in identifying targets within images captured using advanced remote sensing technology with high spatial detail.
What problem does this paper attempt to address?