Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation

Zhankui He,Zhouhang Xie,Harald Steck,Dawen Liang,Rahul Jha,Nathan Kallus,Julian McAuley
2024-05-20
Abstract:Large language models (LLMs) are revolutionizing conversational recommender systems by adeptly indexing item content, understanding complex conversational contexts, and generating relevant item titles. However, controlling the distribution of recommended items remains a challenge. This leads to suboptimal performance due to the failure to capture rapidly changing data distributions, such as item popularity, on targeted conversational recommendation platforms. In conversational recommendation, LLMs recommend items by generating the titles (as multiple tokens) autoregressively, making it difficult to obtain and control the recommendations over all items. Thus, we propose a Reindex-Then-Adapt (RTA) framework, which converts multi-token item titles into single tokens within LLMs, and then adjusts the probability distributions over these single-token item titles accordingly. The RTA framework marries the benefits of both LLMs and traditional recommender systems (RecSys): understanding complex queries as LLMs do; while efficiently controlling the recommended item distributions in conversational recommendations as traditional RecSys do. Our framework demonstrates improved accuracy metrics across three different conversational recommendation datasets and two adaptation settings
Information Retrieval,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issue of ineffective control over the distribution of recommended items when large language models (LLMs) generate recommendations in conversational recommendation systems. Specifically: 1. **Mismatch in Recommended Item Distribution**: Although LLMs excel in understanding complex conversational contexts and generating relevant item titles, they fall short in capturing rapidly changing data distributions (such as item popularity). This leads to suboptimal recommendation results. 2. **Challenges in Generating Multi-Token Titles**: In conversational recommendations, LLMs generate multi-token item titles in an autoregressive manner, making it difficult to obtain and control the distribution of all recommended items. This generation method increases the difficulty of adjusting the recommendation distribution. To address these issues, the paper proposes the Reindex-Then-Adapt (RTA) framework, which converts multi-token item titles into single tokens and adjusts the recommendation probability distribution of these single-token items. The RTA framework combines the understanding and generation capabilities of LLMs with the controllability of traditional recommendation systems, thereby achieving better recommendation accuracy across multiple conversational recommendation datasets.