Abstract:In recent years, YOLO object detection models have undergone significant advancement due to the success of novel deep convolutional networks. The success of these YOLO models is often attributed to their use of guidance techniques, such as expertly tailored deeper backbone and meticulously crafted detector head, which provides effective mechanisms to tradeoff between accuracy and efficiency. However, these sluggish-reasoning models are not capable of handling false detection and negative phenomena, facing challenges include improving the robustness of scaled objects detection against occlude and densely sophisticated scenarios. To address these limitations, we propose a novel object detector, You Only Look Once and None Left (YOLO-NL). Our model includes a novel global dynamic label assignment strategy, which allocates labels for specific anchors to maintain a balance between higher precision detection and finer localization. To enhance the detection capability of multi-scale objects in complex scenes, we separately upgrade CSPNet and PANet using the shortest-longest gradient strategy and self-attention mechanism. To meet the need for fast inference, we propose the Rep-CSPNet network using the reparameterization method to convert residual convolutions to ghost linear operations. Additionally, we accelerate the feature extraction process by deploying the serial SSPP structure. The proposed model is robust to scale objects against negative effectives such as dust, dense, ambiguous, and obstructed scenes. YOLO-NL achieved a mAP of 52.9% on the COCO 2017 test dataset, exhibiting a significant improvement of 2.64% compared to the baseline YOLOX. It is worth noting that YOLO-NL can perform high-accuracy and high-speed face mask detection in real-life scenarios. The YOLO-NL model was employed on self-built FMD and large open-source datasets, and the results show that it outperforms the other state-of-the-art methods, achieving 98.8% accuracy while maintaining 130 FPS.

RFA-YOLO-POSE: A Fusion Algorithm for Pose Detection and Object Identification Amidst Complex Crowds

Pedestrian Detection Method Based on Improved YOLOv5s for Densely Occluded Scenarios

YOLOv8-PoseBoost: Advancements in Multimodal Robot Pose Keypoint Detection

A YOLOX Object Detection Algorithm Based on Bidirectional Cross-scale Path Aggregation

Research on Human Posture Estimation Algorithm Based on YOLO-Pose

RS-YOLO: A YOLO-Based Method for Small Object Detection in Remote Sensing

Object detection in crowded scenes via joint prediction

FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules

Small-Scale Pedestrian Detection Using Fusion Network and Probabilistic Loss

HF-YOLO: Advanced Pedestrian Detection Model with Feature Fusion and Imbalance Resolution

An Improved Pedestrian Detection Model Based on YOLOv8 for Dense Scenes

YOLO-Rlepose: Improved YOLO Based on Swin Transformer and Rle-Oks Loss for Multi-Person Pose Estimation

MDA-YOLO Person: a 2D human pose estimation model based on YOLO detection framework

SYOLO: an Efficient Pedestrian Detection

A YOLO-NL object detector for real-time detection

MAF-YOLO: Multi-modal attention fusion based YOLO for pedestrian detection

MC-YOLOv5: A Multi-Class Small Object Detection Algorithm

Multi-scale feature fusion with attention mechanism for crowded road object detection

YOLOv8-based Dense Pedestrian Detection Algorithm

KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework

YOLO-ABD: A Multi-Scale Detection Model for Pedestrian Anomaly Behavior Detection