Abstract:Many online platforms maintain user profiles for personalization. Unfortunately, these profiles are typically not interpretable or easily modifiable by the user. To remedy this shortcoming, we explore natural language-based user profiles, as they promise enhanced transparency and scrutability of recommender systems. While existing work has shown that language-based profiles from standard LLMs can be effective, such generalist LLMs are unlikely to be optimal for this task. In this paper, we introduce LangPTune, the first end-to-end learning method for training LLMs to produce language-based user profiles that optimize recommendation effectiveness. Through comprehensive evaluations of LangPTune across various training configurations and benchmarks, we demonstrate that our approach significantly outperforms existing profile-based methods. In addition, it approaches performance levels comparable to state-of-the-art, less transparent recommender systems, providing a robust and interpretable alternative to conventional systems. Finally, we validate the relative interpretability of these language-based user profiles through user studies involving crowdworkers and GPT-4-based evaluations. Implementation of LangPTune can be found at <a class="link-external link-https" href="https://github.com/ZhaolinGao/LangPTune" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the transparency and interpretability of the recommendation system while maintaining or enhancing the recommendation effect. Specifically, traditional recommendation systems usually use high - dimensional vectors to represent users and items. Although these vectors are computationally efficient, they are difficult to interpret, and it is very difficult for users to understand, modify or guide the recommendation results. To solve this problem, the author explored natural - language - based user profiles, which enhance the transparency and interpretability of the recommendation system by using natural language to describe users' preferences. ### Main contributions of the paper: 1. **Proposing the LangPTune framework**: This is the first end - to - end training method for generating natural - language user profiles that optimize the recommendation effect. LangPTune combines Reinforcement Learning for System Optimization (RLSO) and Contrastive Learning (CL) to optimize the profile encoder and recommender decoder respectively. 2. **Theoretical derivation and explanation**: The paper provides detailed theoretical derivations and intuitive explanations of how to optimize the model through RLSO and CL. 3. **Experimental verification**: The author implemented the framework using state - of - the - art LLMs and ranking models and conducted experiments on multiple public datasets, proving that LangPTune is significantly superior to other interpretable profile methods and its performance is close to that of traditional recommendation systems. 4. **User study**: Through crowdsourcing workers and GPT - 4 evaluation, the relative interpretability of the generated natural - language user profiles was verified. ### Key technical details: - **Recommender Decoder**: Use natural - language profiles to generate recommendation lists and rank them by calculating the similarity score between the profile and item metadata. - **Profile Encoder**: Convert the user's interaction history into a natural - language profile, which is optimized by RLSO so that the generated profile can maximize the performance of the recommendation system. - **Joint optimization**: Alternately optimize the profile encoder and recommender decoder to ensure that the two components work together to improve the overall recommendation effect. ### Experimental results: - **Performance improvement**: On the Amazon - Movie - TV and Amazon - Books datasets, LangPTune is significantly superior to the baseline methods in multiple evaluation metrics such as NDCG, MRR, and Recall. - **Interpretability**: Through user studies and GPT - 4 evaluation, it was verified that the generated natural - language user profiles have high interpretability. In conclusion, by introducing natural - language - based user profiles, this paper not only improves the transparency and interpretability of the recommendation system but also maintains or even enhances the recommendation effect, providing a new direction for the research and application of recommendation systems.

End-to-end Training for Recommendation with Language-based User Profiles

Language-Based User Profiles for Recommendation

Transparent and Scrutable Recommendations Using Natural Language User Profiles

Guided Profile Generation Improves Personalization with LLMs

A Survey on Large Language Models for Personalized and Explainable Recommendations

Understanding the Role of User Profile in the Personalization of Large Language Models

A Prompting-Based Representation Learning Method for Recommendation with Large Language Models

Prompt Tuning as User Inherent Profile Inference Machine

Recommender Systems in the Era of Large Language Models (LLMs)

Personalized Recommendation Systems Powered By Large Language Models: Integrating Semantic Understanding and User Preferences

Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach

PALR: Personalization Aware LLMs for Recommendation

LLMRec: Benchmarking Large Language Models on Recommendation Task

Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges

Finetuning Large Language Model for Personalized Ranking

Pre-train, Prompt and Recommendation: A Comprehensive Survey of Language Modelling Paradigm Adaptations in Recommender Systems

Let Me Do It For You: Towards LLM Empowered Recommendation via Tool Learning

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

LlamaRec: Two-Stage Recommendation using Large Language Models for Ranking

Personalized Language Modeling from Personalized Human Feedback

Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation