IDGenRec: LLM-RecSys Alignment with Textual ID Learning

Juntao Tan,Shuyuan Xu,Wenyue Hua,Yingqiang Ge,Zelong Li,Yongfeng Zhang

2024-05-17

Abstract:Generative recommendation based on Large Language Models (LLMs) have transformed the traditional ranking-based recommendation style into a text-to-text generation paradigm. However, in contrast to standard NLP tasks that inherently operate on human vocabulary, current research in generative recommendations struggles to effectively encode recommendation items within the text-to-text framework using concise yet meaningful ID representations. To better align LLMs with recommendation needs, we propose IDGen, representing each item as a unique, concise, semantically rich, platform-agnostic textual ID using human language tokens. This is achieved by training a textual ID generator alongside the LLM-based recommender, enabling seamless integration of personalized recommendations into natural language generation. Notably, as user history is expressed in natural language and decoupled from the original dataset, our approach suggests the potential for a foundational generative recommendation model. Experiments show that our framework consistently surpasses existing models in sequential recommendation under standard experimental setting. Then, we explore the possibility of training a foundation recommendation model with the proposed method on data collected from 19 different datasets and tested its recommendation performance on 6 unseen datasets across different platforms under a completely zero-shot setting. The results show that the zero-shot performance of the pre-trained foundation model is comparable to or even better than some traditional recommendation models based on supervised training, showing the potential of the IDGen paradigm serving as the foundation model for generative recommendation. Code and data are open-sourced at

Information Retrieval,Artificial Intelligence,Computation and Language,Machine Learning

What problem does this paper attempt to address?

The paper proposes a method to solve the item encoding problem in generative recommendation systems. The current generative recommendation systems are based on Language Models (LLMs), which convert the recommendation task into a text-to-text generation task. However, there is a problem: how to effectively represent the recommended items with concise and meaningful IDs. Due to this problem, the potential of LLMs in recommendation systems has not been fully explored. The paper introduces a new approach called IDGenRec, which trains a text ID generator to represent each item as a unique, concise, semantically rich, and platform-independent text ID using human language tokens. This allows user history to be expressed in natural language form and decoupled from the original dataset, making it possible to build the base generative recommendation model. Experimental results show that the IDGenRec framework outperforms existing models in sequential recommendation performance under standard experimental settings. After training on 19 different datasets, it also demonstrates zero-shot recommendation performance on 6 unseen datasets, comparable to or even better than some traditional supervised training recommendation models. In summary, the paper attempts to improve the utilization of LLMs in recommendation systems by creating text IDs with semantic information, thereby enhancing the performance of generative recommendation systems. It also achieves knowledge transfer across datasets and zero-shot recommendation capability.

IDGenRec: LLM-RecSys Alignment with Textual ID Learning

GenRec: Large Language Model for Generative Recommendation

Large Language Model for Generative Recommendation

Enhancing ID-based Recommendation with Large Language Models

Representation Learning with Large Language Models for Recommendation

E4SRec: An Elegant Effective Efficient Extensible Solution of Large Language Models for Sequential Recommendation

Reinforce Tokens for the Next Recommendation Generation

A First Look at LLM-Powered Generative News Recommendation

Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation

GenRec: Generative Sequential Recommendation with Large Language Models

Generative Job Recommendations with Large Language Model

Large Language Model Can Interpret Latent Space of Sequential Recommender

Large Language Models for Recommendation: Progresses and Future Directions

Large Language Models for Recommendation: Past, Present, and Future

A Multi-facet Paradigm to Bridge Large Language Model and Recommendation

Large Language Models Are Zero-Shot Rankers for Recommender Systems

Personalized Recommendation Systems Powered By Large Language Models: Integrating Semantic Understanding and User Preferences

RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

One Model for All: Large Language Models Are Domain-Agnostic Recommendation Systems

Collaborative Large Language Model for Recommender Systems