TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation

Keqin Bao,Jizhi Zhang,Yang Zhang,Wenjie Wang,Fuli Feng,Xiangnan He
DOI: https://doi.org/10.1145/3604915.3608857
2023-10-17
Abstract:Large Language Models (LLMs) have demonstrated remarkable performance across diverse domains, thereby prompting researchers to explore their potential for use in recommendation systems. Initial attempts have leveraged the exceptional capabilities of LLMs, such as rich knowledge and strong generalization through In-context Learning, which involves phrasing the recommendation task as prompts. Nevertheless, the performance of LLMs in recommendation tasks remains suboptimal due to a substantial disparity between the training tasks for LLMs and recommendation tasks, as well as inadequate recommendation data during pre-training. To bridge the gap, we consider building a Large Recommendation Language Model by tunning LLMs with recommendation data. To this end, we propose an efficient and effective Tuning framework for Aligning LLMs with Recommendation, namely TALLRec. We have demonstrated that the proposed TALLRec framework can significantly enhance the recommendation capabilities of LLMs in the movie and book domains, even with a limited dataset of fewer than 100 samples. Additionally, the proposed framework is highly efficient and can be executed on a single RTX 3090 with LLaMA-7B. Furthermore, the fine-tuned LLM exhibits robust cross-domain generalization. Our code and data are available at <a class="link-external link-https" href="https://github.com/SAI990323/TALLRec" rel="external noopener nofollow">this https URL</a>.
Information Retrieval
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper attempts to address the application issues of large language models (LLMs) in recommendation systems. Despite LLMs demonstrating exceptional performance in various fields, their performance in recommendation tasks is suboptimal due to the significant differences between training tasks and recommendation tasks, as well as the lack of sufficient recommendation data during the pre-training phase. To solve this problem, the authors propose an effective tuning framework—TALLRec, to align LLMs with recommendation tasks. ### Specific Problems and Solutions 1. **Mismatch between LLMs and Recommendation Tasks**: - **Problem**: The training tasks of LLMs mainly focus on language processing, while recommendation tasks require rich knowledge and strong generalization capabilities, leading to poor performance of LLMs in recommendation tasks. - **Solution**: By constructing a large recommendation language model (LRLM) and tuning LLMs with recommendation data to bridge this gap. 2. **Insufficient Recommendation Data**: - **Problem**: During the pre-training phase of LLMs, recommendation-related data is very limited, restricting their performance in recommendation tasks. - **Solution**: Using the TALLRec framework, tune LLMs with a small amount of recommendation data to better adapt them to recommendation tasks. 3. **Limitations of In-context Learning Alone**: - **Problem**: Relying solely on in-context learning methods performs poorly in recommendation tasks and sometimes fails to make recommendations. - **Solution**: Combine instruction tuning techniques, tuning LLMs with structured instructions and outputs to improve their performance in recommendation tasks. ### Experimental Validation To validate the effectiveness and efficiency of TALLRec, the authors conducted the following experiments: 1. **Performance Comparison**: - On movie and book recommendation datasets, TALLRec significantly outperformed traditional recommendation methods and LLM-based recommendation methods. - Even in few-shot training settings (e.g., 64 samples), TALLRec quickly enhanced the recommendation capabilities of LLMs. 2. **Ablation Study**: - By comparing variants with only alpaca tuning (AT) and only recommendation tuning (RT), the importance of recommendation tuning was verified. - Results showed that alpaca tuning can enhance the generalization ability of LLMs on new tasks, especially when training data is insufficient. 3. **Cross-domain Generalization Analysis**: - TALLRec performed well in recommendation tasks across different domains, demonstrating good cross-domain generalization capabilities. - Joint tuning with data from different domains further improved the performance of TALLRec. ### Main Contributions 1. **Revealed the limitations of in-context learning methods**, emphasizing the importance of instruction tuning in aligning LLMs with recommendation tasks. 2. **Proposed the TALLRec framework**, which can effectively and efficiently tune LLMs with low GPU resource consumption and a small number of tuning samples. 3. **Validated the effectiveness and efficiency of the framework through extensive experiments**, showing excellent performance in cross-domain recommendation tasks. In summary, the paper successfully addresses the application issues of LLMs in recommendation systems through the TALLRec framework, providing new insights for the future development of recommendation systems.