Understanding the Role of Cross-Entropy Loss in Fairly Evaluating Large Language Model-based Recommendation

Cong Xu,Zhangchi Zhu,Jun Wang,Jianyong Wang,Wei Zhang
DOI: https://doi.org/10.48550/arxiv.2402.06216
2024-01-01
Abstract:Large language models (LLMs) have gained much attention in the recommendationcommunity; some studies have observed that LLMs, fine-tuned by thecross-entropy loss with a full softmax, could achieve state-of-the-artperformance already. However, these claims are drawn from unobjective andunfair comparisons. In view of the substantial quantity of items in reality,conventional recommenders typically adopt a pointwise/pairwise loss functioninstead for training. This substitute however causes severe performancedegradation, leading to under-estimation of conventional methods andover-confidence in the ranking capability of LLMs. In this work, we theoretically justify the superiority of cross-entropy, andshowcase that it can be adequately replaced by some elementary approximationswith certain necessary modifications. The remarkable results across threepublic datasets corroborate that even in a practical sense, existing LLM-basedmethods are not as effective as claimed for next-item recommendation. We hopethat these theoretical understandings in conjunction with the empirical resultswill facilitate an objective evaluation of LLM-based recommendation in thefuture.
What problem does this paper attempt to address?