AutoDPQ: Automated Differentiable Product Quantization for Embedding Compression

Xin Gan,Yuhao Wang,Xiangyu Zhao,Wanyu Wang,Yiqi Wang,Zitao Liu
DOI: https://doi.org/10.1145/3539618.3591953
2023-01-01
Abstract:Deep recommender systems typically involve numerous feature fields for users and items, with a large number of low-frequency features. These low-frequency features would reduce the prediction accuracy with large storage space due to their vast quantity and inadequate training. Some pioneering studies have explored embedding compression techniques to address this issue of the trade-off between storage space and model predictability. However, these methods have difficulty compacting the embedding of low-frequency features in various feature fields due to the high demand for human experience and computing resources during hyper-parameter searching. In this paper, we propose the AutoDPQ framework, which automatically compacts low-frequency feature embeddings for each feature field to an adaptive magnitude. Experimental results indicate that AutoDPQ can significantly reduce the parameter space while improving recommendation accuracy. Moreover, AutoDPQ is compatible with various deep CTR models by improving their performance significantly with high efficiency.
What problem does this paper attempt to address?