LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

Jian Jia,Yipei Wang,Yan Li,Honggang Chen,Xuehan Bai,Zhaocheng Liu,Jian Liang,Quan Chen,Han Li,Peng Jiang,Kun Gai
2024-12-10
Abstract:Contemporary recommendation systems predominantly rely on ID embedding to capture latent associations among users and items. However, this approach overlooks the wealth of semantic information embedded within textual descriptions of items, leading to suboptimal performance and poor generalizations. Leveraging the capability of large language models to comprehend and reason about textual content presents a promising avenue for advancing recommendation systems. To achieve this, we propose an Llm-driven knowlEdge Adaptive RecommeNdation (LEARN) framework that synergizes open-world knowledge with collaborative knowledge. We address computational complexity concerns by utilizing pretrained LLMs as item encoders and freezing LLM parameters to avoid catastrophic forgetting and preserve open-world knowledge. To bridge the gap between the open-world and collaborative domains, we design a twin-tower structure supervised by the recommendation task and tailored for practical industrial application. Through experiments on the real large-scale industrial dataset and online A/B tests, we demonstrate the efficacy of our approach in industry application. We also achieve state-of-the-art performance on six Amazon Review datasets to verify the superiority of our method.
Information Retrieval,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that current recommender systems (RS) mainly rely on ID embedding when dealing with the potential associations between users and items, while ignoring the rich semantic information in the item text descriptions. This approach leads to sub - optimal performance of the recommender system, especially in cold - start scenarios and long - tail user recommendations. In addition, the existing ID - embedding - based modeling methods cannot develop a pre - trained model that can perform well across downstream tasks and sub - scenarios as in the fields of computer vision (CV) and natural language processing (NLP). To address these issues, the authors propose a framework named LEARN (Llm - driven knowlEdge Adap - tive RecommeNdation), aiming to introduce the powerful text - understanding and logical - reasoning capabilities of large - language models (LLM) into the recommender system, thereby enhancing its performance and generalization ability. Specifically, the LEARN framework solves the problem in the following ways: 1. **Utilizing the open - world knowledge of LLM**: Use a pre - trained LLM as an item encoder to extract the text - description - content embedding of the item, and freeze the LLM parameters to avoid catastrophic forgetting and retain open - world knowledge. 2. **Two - tower structure design**: Adopt a two - tower structure (user tower and item tower), where each tower contains a content extraction (CEX) module and a preference alignment (PAL) module. The CEX module uses a pre - trained LLM to generate content embeddings, and the PAL module converts the content embeddings into user / item embeddings required by the recommender system through a self - supervised contrast - learning mechanism. 3. **Bridging the open - world and the collaborative domain**: Design the PAL module and adopt the self - supervised training objective of the recommendation task to guide model optimization, narrowing the gap between the open - world and the collaborative domain. 4. **Efficient computation**: To address the computational complexity of large - scale user historical interaction data, the CEX module only uses the LLM to process item text descriptions, rather than user - preference encoding, and freezes the LLM parameters during the training phase. Through these methods, the LEARN framework can effectively integrate the open - world knowledge in LLM into the recommender system, thereby improving the performance and generalization ability of the recommender system, especially in industrial applications. Experimental results show that LEARN has achieved state - of - the - art performance on multiple public datasets and actual industrial datasets. In summary, the main contribution of this paper is to propose a new framework that can effectively apply the knowledge of LLM to the recommender system, solving the problems of insufficient utilization of semantic information and poor generalization ability in existing methods.