Abstract:As remote sensing technology has advanced, the use of satellites and similar technologies has become increasingly prevalent in daily life. Now, it plays a crucial role in hydrology, agriculture, and geography. Nevertheless, because of the distinct qualities of remote sensing, including expansive scenes and small, densely packed targets, there are many challenges in detecting remote sensing objects. Those challenges lead to insufficient accuracy in remote sensing object detection. Consequently, developing a new model is essential to enhance the identification capabilities for objects in remote sensing imagery. To solve these constraints, we have designed the OD-YOLO approach that uses multi-scale feature fusion to improve the performance of the YOLOv8n model in small target detection. Firstly, traditional convolutions have poor recognition capabilities for certain geometric shapes. Therefore, in this paper, we introduce the Detection Refinement Module (DRmodule) into the backbone architecture. This module utilizes Deformable Convolutional Networks and the Hybrid Attention Transformer to strengthen the model's capability for feature extraction from geometric shapes and blurred objects effectively. Meanwhile, based on the Feature Pyramid Network of YOLO, at the head of the model framework, this paper enhances the detection capability by introducing a Dynamic Head to strengthen the fusion of different scales features in the feature pyramid. Additionally, to address the issue of detecting small objects in remote sensing images, this paper specifically designs the OIoU loss function to finely describe the difference between the detection box and the true box, further enhancing model performance. Experiments on the VisDrone dataset show that OD-YOLO surpasses the compared models by at least 5.2% in mAP50 and 4.4% in mAP75, and experiments on the Foggy Cityscapes dataset demonstrated that OD-YOLO improved mAP by 6.5%, demonstrating outstanding results in tasks related to remote sensing images and adverse weather object detection. This work not only advances the research in remote sensing image analysis, but also provides effective technical support for the practical deployment of future remote sensing applications.

YOLO-Former: Marrying YOLO and Transformer for Foreign Object Detection

An Object Detection Method Based on Improved YOLOX

YOLO-Former: YOLO Shakes Hand With ViT

A YOLO-NL object detector for real-time detection

ViT-YOLO:Transformer-Based YOLO for Object Detection

YOLO_SRv2: An evolved version of YOLO_SR

YOLO-World: Real-Time Open-Vocabulary Object Detection

End-to-End Object Detection with YOLOF

YOLO SRv2: an Evolved Version of YOLO SR

YOLO-SDH: improved YOLOv5 using scaled decoupled head for object detection

GL-YOLO-Lite: A Novel Lightweight Fallen Person Detection Model

OD-YOLO: Robust Small Object Detection Model in Remote Sensing Image with a Novel Multi-Scale Feature Fusion

Making You Only Look Once Faster: Toward Real-Time Intelligent Transportation Detection

Multi-Module Model Refinement for Real-Time Object Detection

FFCA-YOLO for Small Object Detection in Remote Sensing Images

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

YOLO‐RSFM: An efficient road small object detection method

MSFT-YOLO: Improved YOLOv5 Based on Transformer for Detecting Defects of Steel Surface

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

YOLO-Extract: Improved YOLOv5 for Aircraft Object Detection in Remote Sensing Images