HR-YOLO: A Multi-Branch Network Model for Helmet Detection Combined with High-Resolution Network and YOLOv5

Yuanfeng Lian,Jing Li,Shaohua Dong,Xingtao Li
DOI: https://doi.org/10.3390/electronics13122271
IF: 2.9
2024-06-11
Electronics
Abstract:Automatic detection of safety helmet wearing is significant in ensuring safe production. However, the accuracy of safety helmet detection can be challenged by various factors, such as complex environments, poor lighting conditions and small-sized targets. This paper presents a novel and efficient deep learning framework named High-Resolution You Only Look Once (HR-YOLO) for safety helmet wearing detection. The proposed framework synthesizes safety helmet wearing information from the features of helmet objects and human pose. HR-YOLO can use features from two branches to make the bounding box of suppression predictions more accurate for small targets. Then, to further improve the iterative efficiency and accuracy of the model, we design an optimized residual network structure by using Optimized Powered Stochastic Gradient Descent (OP-SGD). Moreover, a Laplace-Aware Attention Model (LAAM) is designed to make the YOLOv5 decoder pay more attention to the feature information from human pose and suppress interference from irrelevant features, which enhances network representation. Finally, non-maximum suppression voting (PA-NMS voting) is proposed to improve detection accuracy for occluded targets, using pose information to constrain the confidence of bounding boxes and select optimal bounding boxes through a modified voting process. Experimental results demonstrate that the presented safety helmet detection network outperforms other approaches and has practical value in application scenarios. Compared with the other algorithms, the proposed algorithm improves the precision, recall and mAP by 7.27%, 5.46% and 7.3%, on average, respectively.
engineering, electrical & electronic,physics, applied,computer science, information systems
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address the accuracy issue of helmet-wearing detection in complex environments. Specifically, existing helmet detection methods face challenges in complex environments, poor lighting conditions, and small object detection. To solve these problems, the authors propose a new deep learning framework called High-Resolution You Only Look Once (HR-YOLO) for helmet-wearing detection. ### Main Contributions 1. **Multi-branch Feature Fusion**: HR-YOLO combines high-resolution networks and YOLOv5 to achieve high-quality feature fusion between helmet object detection and human pose estimation. 2. **Optimized Network Structure**: An optimized network structure, OP-SGD, is proposed to enhance network expression capability and accelerate convergence speed. 3. **Laplacian Attention Model (LAAM)**: A new self-attention model, LAAM, is designed to enhance feature extraction and make HR-YOLO more focused on pose features, thereby improving the detection accuracy of occluded and small objects. 4. **Improved Non-Maximum Suppression (PA-NMS Voting)**: The non-maximum suppression algorithm is improved by utilizing human pose information constraints, further enhancing localization accuracy. ### Experimental Results Experimental results show that HR-YOLO outperforms other methods in helmet-wearing detection tasks on the GDUT-HWD and SHWD datasets, improving precision, recall, and mean Average Precision (mAP). Specifically, on the GDUT-HWD dataset, HR-YOLO achieves higher detection precision across multiple categories compared to other methods; on the SHWD dataset, when the IoU threshold is set to 0.5, the detection accuracy reaches 96.1%, significantly outperforming other algorithms.