Data Augmented Sequential Recommendation based on Counterfactual Thinking
Xu Chen,Zhenlei Wang,Hongteng Xu,Jingsen Zhang,Yongfeng Zhang,Wayne Xin Zhao,Ji-Rong Wen
DOI: https://doi.org/10.1109/tkde.2022.3222070
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Sequential recommendation has recently attracted increasing attention from the industry and academic communities. While previous models have achieved remarkable successes, an important problem may still hinder their performances, that is, the sparsity of the real-world data. In this paper, we propose a novel counterfactual data augmentation framework to alleviate the problem of data sparsity. In specific, our framework contains a sampler model and an anchor model. The sampler model aims to generate high-quality user behavior sequences, while the anchor model is trained based on the original and new generated samples, and leveraged to provide the final recommendation list. To implement the sampler model, we first design four types of heuristic methods based on either random or frequency-based strategies. And then, to improve the quality of the generated sequences, we propose two learning-based samplers by discovering the decision boundaries or increasing the sample informativeness. At last, we build an RL based model to automatically determine where to edit the history behaviors and how many items should be replaced. Considering that the sampler model can be imperfect, we, at last, analyze the influence of the noisy information contained in the generated sequences on the anchor model in theory, and design a simple but effective method to better serve the anchor model. We conduct extensive experiments to demonstrate the effectiveness of our model.
computer science, information systems, artificial intelligence,engineering, electrical & electronic