Lightweight Multimodal Feature Graph Convolutional Network for Dangerous Driving Behavior Detection
Wei Xing,Yao Shang,Zhao Chong,Hu Di,Luo Hui,Lu Yang
DOI: https://doi.org/10.1007/s11554-023-01277-9
IF: 2.293
2023-01-01
Journal of Real-Time Image Processing
Abstract:Real-time detection and identification of dangerous driving behaviors is an effective measure to reduce traffic accidents. Due to the high network delay, limited communication bandwidth, and weak computing power, lightweight detection models that can run on edge devices have been widely investigated and attracted considerable attention. In recent years, the Graph Convolutional Network (GCN), which models the human skeleton as a spatiotemporal graph, has achieved remarkable performance, due to its powerful capability of modeling non-Euclidean structured data. However, there are disadvantages such as the unitary way of extracting information, high model complexity, and inability to integrate environmental information. Therefore, we design a Lightweight Multimodal Feature Graph Convolutional Network (L-MFGCN) model for dangerous driving behavior detection video in an end-to-end manner. First, we propose a Multimodal Feature Graph Convolutional Neural Network (MF-GCN), which captures richer features by extracting critical local spatial and temporal information of joint points, and a multi-information fusion behavior recognition model of “people + objects” by capturing the motion information of related object. Then, the method based on Singular Value Decomposition (SVD) rank reduction is used to compress the model to improve the speed of recognizing an action sample while ensuring sufficient detection accuracy. The proposed model, respectively, achieves 96% and 86.3% accuracy on the x-view benchmark of NTU-RGBD dataset and the homemade Locomotive Driver Dataset, which attains the state-of-the-art performance.