Fast Video Facial Expression Recognition by Deeply Tensor-Compressed LSTM Neural Network on Mobile Device.

Peining Zhen,Hai-Bao Chen,Yuan Cheng,Zhigang Ji,Bin Liu,Hao Yu
DOI: https://doi.org/10.1145/3318216.3363322
2021-01-01
ACM Transactions on Internet of Things
Abstract:Poster: Mobile devices usually suffer from limited computation and storage resource which seriously hinders them from deep neural network applications. In this paper, we introduce a deeply tensor-compressed LSTM neural network for fast facial expression recognition (FER) in videos on mobile devices. Firstly, a spatio-temporal FER LSTM model is built by extracting time-series feature maps from facial clips. The LSTM model is further deeply compressed with tensorization. Based on dataset of Acted Facial Expression in Wild (AFEW) 7.0, experimental results show that the proposed method achieves 55.60% classification accuracy; and significantly compresses the size of network model by 219x. Our work is further implemented on RK3399Pro IoT device with Neural Process Engine, and the runtime of feature extraction part can be reduced by 12.83x with only 7.73W power consumption.
What problem does this paper attempt to address?