Aligning Language Models for Versatile Text-based Item Retrieval

Yuxuan Lei,Jianxun Lian,Jing Yao,Mingqi Wu,Defu Lian,Xing Xie
2024-02-29
Abstract:This paper addresses the gap between general-purpose text embeddings and the specific demands of item retrieval tasks. We demonstrate the shortcomings of existing models in capturing the nuances necessary for zero-shot performance on item retrieval tasks. To overcome these limitations, we propose generate in-domain dataset from ten tasks tailored to unlocking models' representation ability for item retrieval. Our empirical studies demonstrate that fine-tuning embedding models on the dataset leads to remarkable improvements in a variety of retrieval tasks. We also illustrate the practical application of our refined model in a conversational setting, where it enhances the capabilities of LLM-based Recommender Agents like Chat-Rec. Our code is available at
Information Retrieval
What problem does this paper attempt to address?
This paper aims to address the issue of general text embedding models performing poorly in specific tasks such as item retrieval. Specifically, existing general language models fall short in capturing the nuances required for item retrieval tasks, especially in zero-shot scenarios. To overcome this limitation, the authors propose a method to enhance the model's capability for item retrieval by creating a specialized fine-tuning dataset that includes 10 different tasks. Experimental results show that the embedding model fine-tuned with this dataset significantly improves performance across various retrieval tasks and also enhances the capability of large language model-based recommendation agents (such as Chat-Rec) in conversational settings.