Experimental Analysis of Large-scale Learnable Vector Storage Compression

Hailin Zhang,Penghao Zhao,Xupeng Miao,Yingxia Shao,Zirui Liu,Tong Yang,Bin Cui
DOI: https://doi.org/10.14778/3636218.3636234
2024-01-01
Abstract:Learnable embedding vector is one of the most important applications inmachine learning, and is widely used in various database-related domains.However, the high dimensionality of sparse data in recommendation tasks and thehuge volume of corpus in retrieval-related tasks lead to a large memoryconsumption of the embedding table, which poses a great challenge to thetraining and deployment of models. Recent research has proposed various methodsto compress the embeddings at the cost of a slight decrease in model quality orthe introduction of other overheads. Nevertheless, the relative performance ofthese methods remains unclear. Existing experimental comparisons only cover asubset of these methods and focus on limited metrics. In this paper, we performa comprehensive comparative analysis and experimental evaluation of embeddingcompression. We introduce a new taxonomy that categorizes these techniquesbased on their characteristics and methodologies, and further develop a modularbenchmarking framework that integrates 14 representative methods. Under auniform test environment, our benchmark fairly evaluates each approach,presents their strengths and weaknesses under different memory budgets, andrecommends the best method based on the use case. In addition to providinguseful guidelines, our study also uncovers the limitations of current methodsand suggests potential directions for future research.
What problem does this paper attempt to address?