Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision

William Gazali,Jocelyn Michelle Kho,Joshua Santoso,Williem

2024-02-28

Abstract:In recent years, model quantization for face recognition has gained prominence. Traditionally, compressing models involved vast datasets like the 5.8 million-image MS1M dataset as well as extensive training times, raising the question of whether such data enormity is essential. This paper addresses this by introducing an efficiency-driven approach, fine-tuning the model with just up to 14,000 images, 440 times smaller than MS1M. We demonstrate that effective quantization is achievable with a smaller dataset, presenting a new paradigm. Moreover, we incorporate an evaluation-based metric loss and achieve an outstanding 96.15% accuracy on the IJB-C dataset, establishing a new state-of-the-art compressed model training for face recognition. The subsequent analysis delves into potential applications, emphasizing the transformative power of this approach. This paper advances model quantization by highlighting the efficiency and optimal results with small data and training time.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

This paper mainly discusses how to effectively quantify face recognition models in small datasets and low bit precision. Traditional model compression methods usually rely on large-scale datasets, such as MS1M, and long-time training. However, this paper proposes an efficiency-driven method that only requires 14,000 images (440 times smaller than MS1M) to fine-tune the model. By introducing an evaluation-based metric loss function, they achieved an accuracy of 96.15% on the IJB-C dataset, establishing a new training state for compressed models. The paper introduces a method different from QuantFace, which focuses on using small datasets (such as 14,000 images from the LFW dataset), significantly reducing training time from 6,600 minutes to 15 minutes, and achieving state-of-the-art performance in a single training epoch. The study also analyzed other techniques in the field of model compression, such as model pruning, and demonstrated that accuracy can be recovered even with limited availability of small data. The experimental results show that this method achieves excellent performance in various face recognition benchmarks, even comparable to models using a large amount of data, even with a small amount of data. This suggests that large datasets are not essential for model quantization and efficient training and deployment can be achieved, especially for resource-constrained devices. Future work may include further exploration of quantization challenges for larger models and the application of low-bit quantization in deep facial recognition models.

Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision

Compressed Models Decompress Race Biases: What Quantized Models Forget for Fair Face Recognition

Hessian-based Mixed-Precision Quantization with Transition Aware Training for Neural Networks

Pse: Mixed Quantization Framework of Neural Networks for Efficient Deployment

Tightly-Coupled Data Compression for Efficient Face Alignment

Rotation Consistent Margin Loss For Efficient Low-Bit Face Recognition

Effect of Lossy Compression Algorithms on Face Image Quality and Recognition

Resource-efficient Face Detector Using 1.5-Bit Frame-to-frame Delta Quantization for Image Based Always-on Wake-up Application.

SQuant: On-the-Fly Data-Free Quantization Via Diagonal Hessian Approximation

Towards Super Compressed Neural Networks for Object Identification: Quantized Low-Rank Tensor Decomposition with Self-Attention

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks

Joint Rate-Distortion Optimization For Simultaneous Texture And Deep Feature Compression Of Facial Images

Efficient Low-Resolution Face Recognition via Bridge Distillation

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

A Deep Image Compression Framework for Face Recognition

Focused Quantization for Sparse CNNs

Compression for Text Detection and Recognition Based on Low Bit-Width Quantization

Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare

Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment

Mitigate Compression Artifacts for Face in Video Recognition