Abstract:Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language processing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model's output, a straightforward approach is to incorporate past user data into the language model prompt, but this approach can result in lengthy inputs exceeding limitations on input length and incurring latency and cost issues. Existing approaches tackle such challenges by selectively extracting relevant user data (i.e. selective retrieval) to construct a prompt for downstream tasks. However, retrieval-based methods are limited by potential information loss, lack of more profound user understanding, and cold-start challenges. To overcome these limitations, we propose a novel summary-augmented approach by extending retrieval-augmented personalization with task-aware user summaries generated by LLMs. The summaries can be generated and stored offline, enabling real-world systems with runtime constraints like voice assistants to leverage the power of LLMs. Experiments show our method with 75% less of retrieved user data is on-par or outperforms retrieval augmentation on most tasks in the LaMP personalization benchmark. We demonstrate that offline summarization via LLMs and runtime retrieval enables better performance for personalization on a range of tasks under practical constraints.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to address several key challenges of personalization in natural language processing (NLP) systems. Specifically, the author attempts to enhance the personalization ability by combining the summarization generation and retrieval techniques of large - language models (LLMs). The following are the main problems that this paper attempts to solve: 1. **Information Loss**: - Existing retrieval - based personalization methods may lead to information loss because they only selectively extract relevant user data while ignoring other potentially important information. 2. **Lack of In - depth Understanding**: - Existing methods have difficulty in achieving a deeper understanding of user data and cannot capture users' subtle preferences and complex behavior patterns. 3. **Cold - Start Problem**: - For new users, due to the lack of sufficient historical behavior data, existing methods often fail to provide effective personalized recommendations or responses. 4. **Input Length Limitation and Inference Cost**: - Incorporating a large amount of user data directly into the language model prompt will result in an overly long input, exceeding the model's input length limit and increasing the inference time and cost. To solve these problems, the author proposes a new method, that is, generating summaries of user data through large - language models and combining these summaries with real - time retrieved relevant user data. This method can provide more comprehensive and accurate user information without increasing the runtime delay, thereby enhancing the personalization effect. Specifically, the workflow of this method is as follows: - **Offline Summary Generation**: Use large - language models to generate summaries of user data, and these summaries can be generated in advance and stored. - **Real - Time Retrieval**: At runtime, retrieve the most relevant entries from the user data according to the task requirements. - **Constructing Prompts**: Combine the retrieved data, the offline - generated summaries, and the current task input to form a complete prompt for the generation task of the downstream language model. In this way, the author hopes to achieve better personalization effects in various NLP tasks, especially in cold - start scenarios, and can provide high - quality personalized output even when user data is sparse. ### Experimental Results Experiments show that this method achieves performance comparable to or better than existing methods on multiple NLP tasks. In particular, it can still maintain or even exceed the original performance even when reducing 75% of the retrieved user data. This proves the effectiveness and superiority of the offline summaries.

Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Personalization of Large Language Models: A Survey

Retrieval-Augmented Personalization for Multimodal Large Language Models

LaMP: When Large Language Models Meet Personalization

Personalized Large Language Models

Knowledge-Augmented Large Language Models for Personalized Contextual Query Suggestion

Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

Personalized Recommendation Systems Powered By Large Language Models: Integrating Semantic Understanding and User Preferences

PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

On the Way to LLM Personalization: Learning to Remember User Conversations

Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting Summarization

PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

LLMs + Persona-Plug = Personalized LLMs

Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization

When large language models meet personalization: perspectives of challenges and opportunities

Personalized Multimodal Large Language Models: A Survey

RealitySummary: Exploring On-Demand Mixed Reality Text Summarization and Question Answering using Large Language Models

Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!