MCX-YOLOv5: efficient helmet detection in complex power warehouse scenarios

Hongchao Xu,Zhenyu Wu
DOI: https://doi.org/10.1007/s11554-023-01406-4
IF: 2.293
2024-02-01
Journal of Real-Time Image Processing
Abstract:The accuracy of the original YOLOv5 algorithm in detecting whether power company employees are wearing helmets is low due to the complex monitoring scenarios in the power warehouse and the small size of the helmets. As a result, it cannot be applied to actual operations. To address this issue, we developed the MCX-YOLOv5 helmet detection algorithm. Our model utilizes the YOLOv5 architecture and integrates a Coordinate-Spatial Attention Module (CSAM) to effectively filter the spatiotemporal data of the feature inputs. Additionally, we implement a Multi-scale Asymmetric Convolutions (MAConv) downsampling module to improve the algorithm's sensitivity to feature scale variations. To address the challenge of task information cross-coupling in coupled heads, we propose a decoupled head that is less heavy than YOLOv6 as a substitute. Our enhanced model achieved a 2.7% rise in the mean Average Precision at 50 (mAP50) and a 4.9% improvement in mAP75 on our self-developed database through multiple experiments, with just a minimal increase in parameters. Our model has yielded significant performance improvements on the Kaggle open-source Hard Hat Workers Detection dataset (HHWD), the public Safety Helmet Wearing Dataset (SHWD), and the PASCAL Visual Object Classes (VOC) dataset. These results highlight the effectiveness of our proposed algorithm in achieving higher accuracy for safety helmet wear detection in storage scenarios.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?