Abstract:The rapid detection of distracted driving behaviors is crucial for enhancing road safety and preventing traffic accidents. Compared with the traditional methods of distracted-driving-behavior detection, the YOLOv8 model has been proven to possess powerful capabilities, enabling it to perceive global information more swiftly. Currently, the successful application of GhostConv in edge computing and embedded systems further validates the advantages of lightweight design in real-time detection using large models. Effectively integrating lightweight strategies into YOLOv8 models and reducing their impact on model performance has become a focal point in the field of real-time distracted driving detection based on deep learning. Inspired by GhostConv, this paper presents an innovative GhostC2f design, aiming to integrate the idea of linear transformation to generate more feature maps without additional computation into YOLOv8 for real-time distracted-driving-detection tasks. The goal is to reduce model parameters and computational load. Additionally, enhancements have been made to the path aggregation network (PAN) to amplify multi-level feature fusion and contextual information propagation. Furthermore, simple attention mechanisms (SimAMs) are introduced to perform self-normalization on each feature map, emphasizing feature maps with valuable information and suppressing redundant information interference in complex backgrounds. Lastly, the nine distinct distracted driving types in the publicly available SFDDD dataset were expanded to 14 categories, and nighttime scenarios were introduced. The results indicate a 5.1% improvement in model accuracy, with model weight size and computational load reduced by 36.7% and 34.6%, respectively. During 30 real vehicle tests, the distracted-driving-detection accuracy reached 91.9% during daylight and 90.3% at night, affirming the exceptional performance of the proposed model in assisting distracted driving detection when driving and contributing to accident-risk reduction.

A lightweight model combining convolutional neural network and Transformer for driver distraction recognition

Multimodal driver distraction detection using dual-channel network of CNN and Transformer

Model Lightweighting for Real-time Distraction Detection on Resource-Limited Devices

Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge Transfer

Vehicle Behavior Recognition using Multi-Stream 3D Convolutional Neural Network

MCANet: Hierarchical cross-fusion lightweight transformer based on multi-ConvHead attention for object detection

TransConvNet: Perform perceptually relevant driver's visual attention predictions

A Lightweight Attention-Based Network towards Distracted Driving Behavior Recognition

Driver attention prediction based on convolution and transformers

Improving real-time driver distraction detection via constrained attention mechanism

L-TLA: A Lightweight Driver Distraction Detection Method Based on Three-Level Attention Mechanisms

A Traffic Sign Recognition System Based on Lightweight Network Learning

Driver Distraction Detection Using Octave-Like Convolutional Neural Network

Distracted Driver Detection Based on a CNN With Decreasing Filter Size

Real-time traffic sign detection network using DS-DetNet and lite fusion FPN

Lightweight Vision Transformer with Cross Feature Attention

Optimizing Road Safety: Advancements in Lightweight YOLOv8 Models and GhostC2f Design for Real-Time Distracted Driving Detection

Recongnition of Distracted Driving Behavior Based on Improved Bi-LSTM Model and Attention Mechanism

DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification

CSFNet: a compact and efficient convolution-transformer hybrid vision model