Driver Abnormal Expression Detection Method Based on Improved Lightweight YOLOv5

Keming Yao,Zhongzhou Wang,Fuao Guo,Feng Li
DOI: https://doi.org/10.3390/electronics13061138
IF: 2.9
2024-03-21
Electronics
Abstract:The rapid advancement of intelligent assisted driving technology has significantly enhanced transportation convenience in society and contributed to the mitigation of traffic safety hazards. Addressing the potential for drivers to experience abnormal physical conditions during the driving process, an enhanced lightweight network model based on YOLOv5 for detecting abnormal facial expressions of drivers is proposed in this paper. Initially, the lightweighting of the YOLOv5 backbone network is achieved by integrating the FasterNet Block, a lightweight module from the FasterNet network, with the C3 module in the main network. This combination forms the C3-faster module. Subsequently, the original convolutional modules in the YOLOv5 model are replaced with the improved GSConvns module to reduce computational load. Building upon the GSConvns module, the VoV-GSCSP module is constructed to ensure the lightweighting of the neck network while maintaining detection accuracy. Finally, channel pruning and fine-tuning operations are applied to the entire model. Channel pruning involves removing channels with minimal impact on output results, further reducing the model's computational load, parameters, and size. The fine-tuning operation compensates for any potential loss in detection accuracy. Experimental results demonstrate that the proposed model achieves a substantial reduction in both parameter count and computational load while maintaining a high detection accuracy of 84.5%. The improved model has a compact size of only 4.6 MB, making it more conducive to the efficient operation of onboard computers.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the issue of detecting abnormal facial expressions of drivers when they experience unusual physical conditions while driving. Specifically, the study proposes a method based on an improved lightweight YOLOv5 network to detect abnormal facial expressions of drivers. #### Main Contributions 1. **Dataset Construction**: To address the lack of facial expression data related to pain and distress in current public datasets, the researchers created their own dataset, mainly including three common facial expression categories of drivers during driving: happy, neutral, and pain. In the driving context, happy and pain expressions may affect certain driving decisions, and in this paper, these two expressions are classified as abnormal driving expressions. 2. **Model Lightweight Improvement**: The backbone network of YOLOv5 was lightweighted, specifically by replacing the C3 module in the backbone network with the C3-faster module and replacing certain convolution modules in the YOLOv5 network with the improved GSConvns lightweight module. Additionally, the neck network was lightweighted using the VoV-GSCSP module to reduce the overall model's parameter count, computational load, and model size while ensuring the model maintains high detection accuracy. 3. **Model Optimization**: The improved network model underwent channel pruning and fine-tuning operations, further reducing the model's parameter count, computational load, and model size. Fine-tuning compensated for potential performance loss during pruning, enabling the model to meet the detection requirements in actual driving environments while maintaining high detection accuracy. Through these improvements, the proposed model significantly reduces the number of parameters and computational load while maintaining a high detection accuracy of 84.5%. The model size is only 4.6 MB, making it more suitable for efficient operation on in-vehicle computers.