MSTN: Multistage Spatial-Temporal Network for Driver Drowsiness Detection

Tun-Huai Shih,Chiou-Ting Hsu
DOI: https://doi.org/10.1007/978-3-319-54526-4_11
2017-01-01
Abstract:Recent survey has shown that drowsy driving is one of the main factors in fatal motor vehicle crashes. In this paper, given only the visual information of the driver, we propose a Multistage Spatial-Temporal Network (MSTN) to efficiently and accurately detect driver drowsiness. The proposed MSTN consists of a spatial CNN, a temporal LSTM, and then followed by a temporal smoothing. Firstly, we use the spatial CNN to effectively extract drowsiness-related features from the face region detected from each video frame. Then, we model the temporal variation of the drowsiness status by feeding a sequence of frame-level features into the Long Short Term Memory (LSTM). Finally, we conduct the temporal smoothing to smooth the predicted drowsiness scores in order to avoid noisy predictions. We evaluate the proposed MSTN using NTHU Drowsy Driver Detection Video Dataset and achieve 82.61% overall accuracy on the testing set.
What problem does this paper attempt to address?