Abstract:In view of the small size and dense distribution of remote sensing image targets, this paper adds a detection head P2 specifically for small-scale targets on the basis of the three detection layers of the original YOLOv5 model, and involves the shallow high-resolution feature map in the subsequent multi-scale feature fusion module. The problem of losing the key feature information of the small-scale target in the process of multiple downsampling is effectively avoided. Firstly, an enhanced multi-scale feature fusion pyramid network DSI-FPN is designed. The FPN+PAN network is optimized by using DepthwiseSparable Convolution and Involution operators with fewer parameters and computations, as well as a spatial attention mechanism to generate feature graphs with richer information for network detection tasks. Secondly, we propose an adaptive channel spatial attention mechanism SCBAM, which introduces a self-attention mechanism into CBAM module to add non-local information to the interaction that originally had only local information, breaks the convolution kernel limit, expands the model receptive field, and improves the feature expression ability of the model. Thirdly, in order to solve the problem of insufficient computing power when deploying the target detector for equipment, we propose a network knowledge distillation framework for joint teachers based on the feature layer. The distillation loss of teacher is designed, and the trend of student online learning is adjusted dynamically by balancing the contributions of teacher network and truth value. The detection accuracy of the student network is obviously improved, and the parameters and model size of the network are effectively reduced. Finally, Comparing with other remote sensing image object detection methods, the experimental results show that the approach presented has better detection effect for small-scale targets of remote sensing images under different lighting conditions. The detection accuracy reached 43.9%, and 7.4% higher than that of the original model. After knowledge distillation, the model parameters are reduced to 1/3 of the original, and the detection accuracy is 40.2%.

An Improved YOLOv5 Model Based on Feature Fusion and Attention Mechanism for Multi-Scale Satellite Recognition

Improvement of Yolov5 Target Detection Algorithm Combined with Multi-Scale Feature Fusion

Target detection based on multi-scale feature fusion and cross-channel interactive attention mechanism

An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network

Target Detection of Remote Sensing Image Based on an Improved YOLOv5

Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection

Approach for improving YOLOv5 network with application to remote sensing target detection

Object Detection in Aerial Remote Sensing Images with Multi-scale Feature Enhancement

FE-YOLOv5: Improved YOLOv5 Network for Multi-scale Drone-Captured Scene Detection

An Aerial Image Detection Algorithm Based on Improved YOLOv5

Remote Sensing Image Target Detection and Recognition Based on YOLOv5

SAR Image Aircraft Target Recognition Based on Improved YOLOv5

Aerial images object detection method based on cross-scale multi-feature fusion

Improved Lightweight YOLOv5 Using Attention Mechanism for Satellite Components Recognition

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

An Improved YOLOv8 Detector for Multi-Scale Target Detection in Remote Sensing Images

An Improved YOLOv5 Method for Small Object Detection in UAV Capture Scenes

Improved remote sensing image target detection based on YOLOv7

Adaptively Attentional Feature Fusion Oriented to Multiscale Object Detection in Remote Sensing Images

Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism