Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision

William Gazali,Jocelyn Michelle Kho,Joshua Santoso,Williem
2024-02-28
Abstract:In recent years, model quantization for face recognition has gained prominence. Traditionally, compressing models involved vast datasets like the 5.8 million-image MS1M dataset as well as extensive training times, raising the question of whether such data enormity is essential. This paper addresses this by introducing an efficiency-driven approach, fine-tuning the model with just up to 14,000 images, 440 times smaller than MS1M. We demonstrate that effective quantization is achievable with a smaller dataset, presenting a new paradigm. Moreover, we incorporate an evaluation-based metric loss and achieve an outstanding 96.15% accuracy on the IJB-C dataset, establishing a new state-of-the-art compressed model training for face recognition. The subsequent analysis delves into potential applications, emphasizing the transformative power of this approach. This paper advances model quantization by highlighting the efficiency and optimal results with small data and training time.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper mainly discusses how to effectively quantify face recognition models in small datasets and low bit precision. Traditional model compression methods usually rely on large-scale datasets, such as MS1M, and long-time training. However, this paper proposes an efficiency-driven method that only requires 14,000 images (440 times smaller than MS1M) to fine-tune the model. By introducing an evaluation-based metric loss function, they achieved an accuracy of 96.15% on the IJB-C dataset, establishing a new training state for compressed models. The paper introduces a method different from QuantFace, which focuses on using small datasets (such as 14,000 images from the LFW dataset), significantly reducing training time from 6,600 minutes to 15 minutes, and achieving state-of-the-art performance in a single training epoch. The study also analyzed other techniques in the field of model compression, such as model pruning, and demonstrated that accuracy can be recovered even with limited availability of small data. The experimental results show that this method achieves excellent performance in various face recognition benchmarks, even comparable to models using a large amount of data, even with a small amount of data. This suggests that large datasets are not essential for model quantization and efficient training and deployment can be achieved, especially for resource-constrained devices. Future work may include further exploration of quantization challenges for larger models and the application of low-bit quantization in deep facial recognition models.