Spatial-Temporal Feature Representation Learning for Facial Fatigue Detection.

Changyuan Wang,Ting Yan,Hongbo Jia
DOI: https://doi.org/10.1142/s0218001418560189
IF: 1.261
2018-01-01
International Journal of Pattern Recognition and Artificial Intelligence
Abstract:In order to reduce the serious problems caused by the operators’ fatigue, we propose a novel network model Convolutional Neural Network and Long Short-Term Memory Network (CNN-LSTM) — for fatigue detection in the inter-frame images of video sequences, which mainly consists of CNN and LSTM network. Firstly, in order to improve the accuracy of the deep network structure, the Viola–Jones detection algorithm and the Kernelized Correlation Filter (KCF) tracking algorithm are used in the face detection to normalize the size of the inter-frame images of video sequences. Secondly, we use the CNN and the LSTM network to detect the fatigue state in real time and efficiently. The fatigue-related facial features are extracted by the CNN. Then, the temporal symptoms of the whole fatigue process can be extracted by LSTM networks, the input data which is the facial feature vector can be obtained by the CNN. Thirdly, we train and test the network in a step-by-step approach. Finally, we experiment with the proposed network model. The experimental results demonstrate that the network structure can effectively detect the fatigue state, and the overall accuracy rate can rise to 82.8%.
What problem does this paper attempt to address?