Efficient Low-Resolution Face Recognition via Bridge Distillation

Shiming Ge,Shengwei Zhao,Chenyu Li,Yu Zhang,Jia Li
2024-09-18
Abstract:Face recognition in the wild is now advancing towards light-weight models, fast inference speed and resolution-adapted capability. In this paper, we propose a bridge distillation approach to turn a complex face model pretrained on private high-resolution faces into a light-weight one for low-resolution face recognition. In our approach, such a cross-dataset resolution-adapted knowledge transfer problem is solved via two-step distillation. In the first step, we conduct cross-dataset distillation to transfer the prior knowledge from private high-resolution faces to public high-resolution faces and generate compact and discriminative features. In the second step, the resolution-adapted distillation is conducted to further transfer the prior knowledge to synthetic low-resolution faces via multi-task learning. By learning low-resolution face representations and mimicking the adapted high-resolution knowledge, a light-weight student model can be constructed with high efficiency and promising accuracy in recognizing low-resolution faces. Experimental results show that the student model performs impressively in recognizing low-resolution faces with only 0.21M parameters and 0.057MB memory. Meanwhile, its speed reaches up to 14,705, ~934 and 763 faces per second on GPU, CPU and mobile phone, respectively.
Computer Vision and Pattern Recognition,Artificial Intelligence,Multimedia
What problem does this paper attempt to address?
The paper attempts to address the problem of efficient face recognition in low-resolution scenarios. Specifically, most current high-accuracy face recognition models perform poorly on low-resolution images due to their complex architectures and large number of parameters, and their deployment on low-end devices is also uneconomical. To solve these issues, the authors propose a method called "bridge distillation." This method converts a complex model pre-trained on a private high-resolution face dataset into a lightweight model suitable for low-resolution face recognition tasks through a two-step distillation process. The specific steps are as follows: 1. **Cross-dataset distillation**: First, knowledge is extracted from the private high-resolution face dataset and adapted to a public high-resolution face dataset, generating compact and discriminative feature representations. 2. **Resolution-adaptive distillation**: Next, this knowledge is further transferred to synthetic low-resolution face images. Through multi-task learning, a lightweight student model is trained, which can maintain high recognition accuracy while significantly reducing computational resources and memory consumption. Through the above methods, researchers can construct a model with a minimal number of parameters, low memory usage, and very fast inference speed. Experiments have shown that this model performs excellently in low-resolution face recognition tasks.