End-to-End Learnable Item Tokenization for Generative Recommendation

Enze Liu,Bowen Zheng,Cheng Ling,Lantao Hu,Han Li,Wayne Xin Zhao

DOI: https://doi.org/10.48550/arXiv.2409.05546

2024-09-09

Abstract:Recently, generative recommendation has emerged as a promising new paradigm that directly generates item identifiers for recommendation. However, a key challenge lies in how to effectively construct item identifiers that are suitable for recommender systems. Existing methods typically decouple item tokenization from subsequent generative recommendation training, likely resulting in suboptimal performance. To address this limitation, we propose ETEGRec, a novel End-To-End Generative Recommender by seamlessly integrating item tokenization and generative recommendation. Our framework is developed based on the dual encoder-decoder architecture, which consists of an item tokenizer and a generative recommender. In order to achieve mutual enhancement between the two components, we propose a recommendation-oriented alignment approach by devising two specific optimization objectives: sequence-item alignment and preference-semantic alignment. These two alignment objectives can effectively couple the learning of item tokenizer and generative recommender, thereby fostering the mutual enhancement between the two components. Finally, we further devise an alternating optimization method, to facilitate stable and effective end-to-end learning of the entire framework. Extensive experiments demonstrate the effectiveness of our proposed framework compared to a series of traditional sequential recommendation models and generative recommendation baselines.

Information Retrieval

What problem does this paper attempt to address?

The problem this paper attempts to address is the effective construction of item identifiers in generative recommendation systems. Specifically, existing generative recommendation methods typically handle item tokenization separately from the subsequent generative recommendation training process, which may lead to suboptimal performance. To overcome this limitation, the paper proposes a new framework called ETEGRec, which achieves end-to-end learning by seamlessly integrating item tokenization and generative recommendation. ETEGRec is based on a dual encoder-decoder architecture, including an item tokenizer and a generative recommender. To facilitate mutual enhancement between the two components, the paper proposes two specific optimization objectives: sequence-item alignment and preference-semantic alignment. Additionally, the paper designs an alternating optimization method to ensure stable and effective end-to-end learning of the entire framework. Experimental results show that ETEGRec outperforms traditional sequential recommendation models and generative recommendation baseline models on multiple recommendation benchmark datasets.

End-to-End Learnable Item Tokenization for Generative Recommendation

Learnable Item Tokenization for Generative Recommendation

EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

TokenRec: Learning to Tokenize ID for LLM-based Generative Recommendation

Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning

IDGenRec: LLM-RecSys Alignment with Textual ID Learning

Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers

Learning to Tokenize for Generative Retrieval

A General Tail Item Representation Enhancement Framework for Sequential Recommendation

Content-Based Collaborative Generation for Recommender Systems

GenRec: Generative Sequential Recommendation with Large Language Models

Unifying Generative and Dense Retrieval for Sequential Recommendation

GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation

Generative Session-based Recommendation

Generative Recommendation: Towards Next-generation Recommender Paradigm

Generate and Instantiate What You Prefer: Text-Guided Diffusion for Sequential Recommendation

LkeRec: Toward Lightweight End-to-End Joint Representation Learning for Building Accurate and Effective Recommendation

RecGPT: Generative Personalized Prompts for Sequential Recommendation via ChatGPT Training Paradigm

Continuous-Time Sequential Recommendation with Temporal Graph Collaborative Transformer

CoST: Contrastive Quantization Based Semantic Tokenization for Generative Recommendation

Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation