Abstract:In recent years, YOLO object detection models have undergone significant advancement due to the success of novel deep convolutional networks. The success of these YOLO models is often attributed to their use of guidance techniques, such as expertly tailored deeper backbone and meticulously crafted detector head, which provides effective mechanisms to tradeoff between accuracy and efficiency. However, these sluggish-reasoning models are not capable of handling false detection and negative phenomena, facing challenges include improving the robustness of scaled objects detection against occlude and densely sophisticated scenarios. To address these limitations, we propose a novel object detector, You Only Look Once and None Left (YOLO-NL). Our model includes a novel global dynamic label assignment strategy, which allocates labels for specific anchors to maintain a balance between higher precision detection and finer localization. To enhance the detection capability of multi-scale objects in complex scenes, we separately upgrade CSPNet and PANet using the shortest-longest gradient strategy and self-attention mechanism. To meet the need for fast inference, we propose the Rep-CSPNet network using the reparameterization method to convert residual convolutions to ghost linear operations. Additionally, we accelerate the feature extraction process by deploying the serial SSPP structure. The proposed model is robust to scale objects against negative effectives such as dust, dense, ambiguous, and obstructed scenes. YOLO-NL achieved a mAP of 52.9% on the COCO 2017 test dataset, exhibiting a significant improvement of 2.64% compared to the baseline YOLOX. It is worth noting that YOLO-NL can perform high-accuracy and high-speed face mask detection in real-life scenarios. The YOLO-NL model was employed on self-built FMD and large open-source datasets, and the results show that it outperforms the other state-of-the-art methods, achieving 98.8% accuracy while maintaining 130 FPS.

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection.

A Lightweight SE-YOLOv3 Network for Multi-Scale Object Detection in Remote Sensing Imagery.

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

Multi-Module Model Refinement for Real-Time Object Detection

MSF-YOLO: A multi-scale features fusion-based method for small object detection

YOLOv10: Real-Time End-to-End Object Detection

MSYOLOF: Multi-input-single-output Encoder Network with Tripartite Feature Enhancement for Object Detection.

M2YOLOF: Based on effective receptive fields and multiple-in-single-out encoder for object detection

MS-YOLOv7:YOLOv7 Based on Multi-Scale for Object Detection on UAV Aerial Photography

End-to-End Object Detection with YOLOF

MSA-YOLO: A Remote Sensing Object Detection Model Based on Multi-Scale Strip Attention.

A YOLO-NL object detector for real-time detection

SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery

YOLOH: You Only Look One Hourglass for Real-Time Object Detection

An improved YOLOv8 algorithm for small object detection in autonomous driving

Lite-YOLOv3: a real-time object detector based on multi-scale slice depthwise convolution and lightweight attention mechanism

GMS-YOLO: An Algorithm for Multi-Scale Object Detection in Complex Environments in Confined Compartments

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

MC-YOLOv5: A Multi-Class Small Object Detection Algorithm

DMS-YOLOv5: A Decoupled Multi-Scale YOLOv5 Method for Small Object Detection

CF-YOLOX: An Autonomous Driving Detection Model for Multi-Scale Object Detection