Facial Action Recognition Using Very Deep Networks for Highly Imbalanced Class Distribution

Wan Ding,Dong-Yan Huang,Zhuo Chen,Xinguo Yu,Weisi Lin
DOI: https://doi.org/10.1109/apsipa.2017.8282246
2017-01-01
Abstract:Positive samples of facial actions are much fewer than negative samples in natural conditions. The highly imbalanced class-distributions may cause very slow rate of convergence of error when using neural networks for facial action recognition. Traditional methods tackle this class-imbalance problem by changing data distributions, which is challenging for preventing the loss of useful information. In this paper we tackle this problem by using very deep (>10 layers) architectures to increase the chance that network training has acceptable rate of convergence using highly imbalanced data sets. Experimental results on EmotioNet Challenge data set show that the error rates of very deep covolutional networks converge to 40% after 90 epochs while shallower networks only converge to 60%. The results also show that very deep network outperforms shallower network by 0.2 on accuracy score. The proposed neural networks won the first place of the first track in the automatic detection of action units (AUs) of EmotioNet Challenge.
What problem does this paper attempt to address?