Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models

Chris Richardson,Yao Zhang,Kellen Gillespie,Sudipta Kar,Arshdeep Singh,Zeynab Raeesy,Omar Zia Khan,Abhinav Sethy
2023-10-31
Abstract:Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language processing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model's output, a straightforward approach is to incorporate past user data into the language model prompt, but this approach can result in lengthy inputs exceeding limitations on input length and incurring latency and cost issues. Existing approaches tackle such challenges by selectively extracting relevant user data (i.e. selective retrieval) to construct a prompt for downstream tasks. However, retrieval-based methods are limited by potential information loss, lack of more profound user understanding, and cold-start challenges. To overcome these limitations, we propose a novel summary-augmented approach by extending retrieval-augmented personalization with task-aware user summaries generated by LLMs. The summaries can be generated and stored offline, enabling real-world systems with runtime constraints like voice assistants to leverage the power of LLMs. Experiments show our method with 75% less of retrieved user data is on-par or outperforms retrieval augmentation on most tasks in the LaMP personalization benchmark. We demonstrate that offline summarization via LLMs and runtime retrieval enables better performance for personalization on a range of tasks under practical constraints.
Computation and Language,Artificial Intelligence,Information Retrieval
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to address several key challenges of personalization in natural language processing (NLP) systems. Specifically, the author attempts to enhance the personalization ability by combining the summarization generation and retrieval techniques of large - language models (LLMs). The following are the main problems that this paper attempts to solve: 1. **Information Loss**: - Existing retrieval - based personalization methods may lead to information loss because they only selectively extract relevant user data while ignoring other potentially important information. 2. **Lack of In - depth Understanding**: - Existing methods have difficulty in achieving a deeper understanding of user data and cannot capture users' subtle preferences and complex behavior patterns. 3. **Cold - Start Problem**: - For new users, due to the lack of sufficient historical behavior data, existing methods often fail to provide effective personalized recommendations or responses. 4. **Input Length Limitation and Inference Cost**: - Incorporating a large amount of user data directly into the language model prompt will result in an overly long input, exceeding the model's input length limit and increasing the inference time and cost. To solve these problems, the author proposes a new method, that is, generating summaries of user data through large - language models and combining these summaries with real - time retrieved relevant user data. This method can provide more comprehensive and accurate user information without increasing the runtime delay, thereby enhancing the personalization effect. Specifically, the workflow of this method is as follows: - **Offline Summary Generation**: Use large - language models to generate summaries of user data, and these summaries can be generated in advance and stored. - **Real - Time Retrieval**: At runtime, retrieve the most relevant entries from the user data according to the task requirements. - **Constructing Prompts**: Combine the retrieved data, the offline - generated summaries, and the current task input to form a complete prompt for the generation task of the downstream language model. In this way, the author hopes to achieve better personalization effects in various NLP tasks, especially in cold - start scenarios, and can provide high - quality personalized output even when user data is sparse. ### Experimental Results Experiments show that this method achieves performance comparable to or better than existing methods on multiple NLP tasks. In particular, it can still maintain or even exceed the original performance even when reducing 75% of the retrieved user data. This proves the effectiveness and superiority of the offline summaries.