Using DSCB: A Depthwise Separable Convolution Block Rebuild MTCNN for Face Detection.
Qiang Wang,Jingru Cui,Zunying Qin,Ninggang An,Xiaofei Ma,Guodong Li
DOI: https://doi.org/10.1145/3512388.3512389
2022-01-01
Abstract:Nowadays, there are huge demands of face detection in images and videos for surveillance, education, autonomous driving and health care. These application scenarios need high accuracy and efficiency of face detection. However, in some scene, unconstrained pose variation, occlusion, large number of faces and illumination bring great challenges to existing face detection methods. In view of above problems, we propose a depthwise separable convolution block (DSCB) which can maintain the speed of training and improve the accuracy at the same time. Then, using the proposed DSCB, we design a face detection model based on MTCNN (Multi-task Convolution Neural Network) to improve performance of occlusion, unconstrained pose variation, large numbers of small targets. In order to better evaluate the proposed method, we built a new dataset which is derived from the classroom teaching scene for training and evaluating. Our dataset consists of 7168 images and 294924 face bounding boxes with occlusion, unconstrained pose variation, and large numbers of small targets. The comparative experiments on our dataset show that the proposed method is superior to other state-of-the-art methods in accuracy and speed of face detection. Compared with the original MTCNN, the face detection method we proposed can bring about 3.9%, 8.66% and 1.39 times overall performance improvement on precision, recall and detection speed respectively.