Reinforce Tokens for the Next Recommendation Generation

Kai Zhang,Linping Gao,Zhaobao Yang,Biao Zhang,Xiaofen Xing
DOI: https://doi.org/10.1007/978-981-97-5618-6_21
2024-01-01
Abstract:Generative sequential recommendations based on Large Language Models (LLMs) have garnered significant attention, due to the remarkable generalized capabilities of LLMs. However, the conventional ID-based representation not only widens the gap between language modeling and preference modeling but also leaves rich textual side information untouched. While textual representation can harness linguistic capabilities, it falls short in capturing the potential connection between users and items. In this paper, we introduce a method, Reinforces Tokens for the next Recommendation generation (RT-Rec), aimed at empowering textual-based tokens with high-order relation representation abilities. We frame the sequential recommendation task as the objective of next-word generation and effectively harness LLM to accomplish our goals. Specifically, we employ both LLMs and graph-based collaborative filtering to modal user preferences and generate semantically rich embeddings. Subsequently, We innovatively incorporate graph convolution within vector quantization and train diverse collaborative filters by distilling the corresponding graph layer knowledge. Finally, we adhere to the conditional language tuning on LLaMA2 without dissecting its internal modules. By doing so, we convert the recommendation task into a conventional natural language task of generating the next token. Empirical evaluations demonstrate the effectiveness of our RT-Rec in both capturing potential interactions and handling sequential recommendation tasks.
What problem does this paper attempt to address?