End-to-end Training for Recommendation with Language-based User Profiles

Zhaolin Gao,Joyce Zhou,Yijia Dai,Thorsten Joachims
2024-10-24
Abstract:Many online platforms maintain user profiles for personalization. Unfortunately, these profiles are typically not interpretable or easily modifiable by the user. To remedy this shortcoming, we explore natural language-based user profiles, as they promise enhanced transparency and scrutability of recommender systems. While existing work has shown that language-based profiles from standard LLMs can be effective, such generalist LLMs are unlikely to be optimal for this task. In this paper, we introduce LangPTune, the first end-to-end learning method for training LLMs to produce language-based user profiles that optimize recommendation effectiveness. Through comprehensive evaluations of LangPTune across various training configurations and benchmarks, we demonstrate that our approach significantly outperforms existing profile-based methods. In addition, it approaches performance levels comparable to state-of-the-art, less transparent recommender systems, providing a robust and interpretable alternative to conventional systems. Finally, we validate the relative interpretability of these language-based user profiles through user studies involving crowdworkers and GPT-4-based evaluations. Implementation of LangPTune can be found at <a class="link-external link-https" href="https://github.com/ZhaolinGao/LangPTune" rel="external noopener nofollow">this https URL</a>.
Information Retrieval,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the transparency and interpretability of the recommendation system while maintaining or enhancing the recommendation effect. Specifically, traditional recommendation systems usually use high - dimensional vectors to represent users and items. Although these vectors are computationally efficient, they are difficult to interpret, and it is very difficult for users to understand, modify or guide the recommendation results. To solve this problem, the author explored natural - language - based user profiles, which enhance the transparency and interpretability of the recommendation system by using natural language to describe users' preferences. ### Main contributions of the paper: 1. **Proposing the LangPTune framework**: This is the first end - to - end training method for generating natural - language user profiles that optimize the recommendation effect. LangPTune combines Reinforcement Learning for System Optimization (RLSO) and Contrastive Learning (CL) to optimize the profile encoder and recommender decoder respectively. 2. **Theoretical derivation and explanation**: The paper provides detailed theoretical derivations and intuitive explanations of how to optimize the model through RLSO and CL. 3. **Experimental verification**: The author implemented the framework using state - of - the - art LLMs and ranking models and conducted experiments on multiple public datasets, proving that LangPTune is significantly superior to other interpretable profile methods and its performance is close to that of traditional recommendation systems. 4. **User study**: Through crowdsourcing workers and GPT - 4 evaluation, the relative interpretability of the generated natural - language user profiles was verified. ### Key technical details: - **Recommender Decoder**: Use natural - language profiles to generate recommendation lists and rank them by calculating the similarity score between the profile and item metadata. - **Profile Encoder**: Convert the user's interaction history into a natural - language profile, which is optimized by RLSO so that the generated profile can maximize the performance of the recommendation system. - **Joint optimization**: Alternately optimize the profile encoder and recommender decoder to ensure that the two components work together to improve the overall recommendation effect. ### Experimental results: - **Performance improvement**: On the Amazon - Movie - TV and Amazon - Books datasets, LangPTune is significantly superior to the baseline methods in multiple evaluation metrics such as NDCG, MRR, and Recall. - **Interpretability**: Through user studies and GPT - 4 evaluation, it was verified that the generated natural - language user profiles have high interpretability. In conclusion, by introducing natural - language - based user profiles, this paper not only improves the transparency and interpretability of the recommendation system but also maintains or even enhances the recommendation effect, providing a new direction for the research and application of recommendation systems.