EfficientSRFace: An Efficient Network with Super-Resolution Enhancement for Accurate Face Detection

Guangtao Wang,Jun Li,Jie Xie,Jianhua Xu,Bo Yang
2023-06-04
Abstract:In face detection, low-resolution faces, such as numerous small faces of a human group in a crowded scene, are common in dense face prediction tasks. They usually contain limited visual clues and make small faces less distinguishable from the other small objects, which poses great challenge to accurate face detection. Although deep convolutional neural network has significantly promoted the research on face detection recently, current deep face detectors rarely take into account low-resolution faces and are still vulnerable to the real-world scenarios where massive amount of low-resolution faces exist. Consequently, they usually achieve degraded performance for low-resolution face detection. In order to alleviate this problem, we develop an efficient detector termed EfficientSRFace by introducing a feature-level super-resolution reconstruction network for enhancing the feature representation capability of the model. This module plays an auxiliary role in the training process, and can be removed during the inference without increasing the inference time. Extensive experiments on public benchmarking datasets, such as FDDB and WIDER Face, show that the embedded image super-resolution module can significantly improve the detection accuracy at the cost of a small amount of additional parameters and computational overhead, while helping our model achieve competitive performance compared with the state-of-the-arts methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve This paper aims to address the challenges in low-resolution face detection. In dense face prediction tasks, such as crowds in crowded scenes, low-resolution faces are very common. These small faces usually contain limited visual cues, making them difficult to distinguish from other small objects, thus posing a significant challenge to accurate face detection. Although deep convolutional neural networks (CNNs) have made significant progress in face detection, existing deep face detectors rarely consider low-resolution faces and perform poorly when dealing with a large number of low-resolution faces in real-world scenarios. To alleviate this problem, the authors propose an efficient detector called EfficientSRFace, which enhances the model's feature representation capability by introducing a feature-level super-resolution reconstruction network. This module serves as an auxiliary during training but can be removed during the inference stage, without increasing inference time. Experimental results show that the embedded image super-resolution module can significantly improve detection accuracy with a small amount of additional parameters and computational overhead, making the model competitive with existing state-of-the-art methods in terms of performance.