Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection

Shuqi Shen,Junjie Yang
2024-05-04
Abstract:Safety helmets play a crucial role in protecting workers from head injuries in construction sites, where potential hazards are prevalent. However, currently, there is no approach that can simultaneously achieve both model accuracy and performance in complex environments. In this study, we utilized a Yolo-based model for safety helmet detection, achieved a 2% improvement in mAP (mean Average Precision) performance while reducing parameters and Flops count by over 25%. YOLO(You Only Look Once) is a widely used, high-performance, lightweight model architecture that is well suited for complex environments. We presents a novel approach by incorporating a lightweight feature extraction network backbone based on GhostNetv2, integrating attention modules such as Spatial Channel-wise Attention Net(SCNet) and Coordination Attention Net(CANet), and adopting the Gradient Norm Aware optimizer (GAM) for improved generalization ability. In safety-critical environments, the accurate detection and speed of safety helmets plays a pivotal role in preventing occupational hazards and ensuring compliance with safety protocols. This work addresses the pressing need for robust and efficient helmet detection methods, offering a comprehensive framework that not only enhances accuracy but also improves the adaptability of detection models to real-world conditions. Our experimental results underscore the synergistic effects of GhostNetv2, attention modules, and the GAM optimizer, presenting a compelling solution for safety helmet detection that achieves superior performance in terms of accuracy, generalization, and efficiency.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper mainly tackles the accuracy and efficiency issues of safety helmet detection in complex environments, such as construction sites. The existing methods have shortcomings in model accuracy and performance. The researchers propose an improved YOLO (You Only Look Once) model by integrating the lightweight feature extraction network GhostNetv2, attention modules (such as Spatial Channel-wise Attention Net and Coordination Attention Net), and Gradient Norm Aware optimizer. This improves the model's generalization ability and reduces parameters and computation. This method aims to detect safety helmets accurately and quickly to prevent workplace hazards and ensure compliance with safety regulations. Experimental results show that this method improves the model's adaptability to real-world environments while maintaining high accuracy.