Pre-trained Recommender Systems: A Causal Debiasing Perspective

Ziqian Lin,Hao Ding,Nghia Trong Hoang,Branislav Kveton,Anoop Deoras,Hao Wang
2024-01-09
Abstract:Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired by such progress, we investigate in this paper the possibilities and challenges of adapting such a paradigm to the context of recommender systems, which is less investigated from the perspective of pre-trained model. In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data). However, unlike vision/language data which share strong conformity in the semantic space, universal patterns underlying recommendation data collected across different domains (e.g., different countries or different E-commerce platforms) are often occluded by both in-domain and cross-domain biases implicitly imposed by the cultural differences in their user and item bases, as well as their uses of different e-commerce platforms. As shown in our experiments, such heterogeneous biases in the data tend to hinder the effectiveness of the pre-trained model. To address this challenge, we further introduce and formalize a causal debiasing perspective, which is substantiated via a hierarchical Bayesian deep learning model, named PreRec. Our empirical studies on real-world data show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings under both cross-market and cross-platform scenarios.
Information Retrieval,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the data scarcity problem in recommendation systems, especially in zero - shot learning and few - shot learning in cross - market and cross - platform scenarios, and how to effectively utilize pre - trained models to improve recommendation performance. Specifically, the paper focuses on two key challenges: 1. **Cross - Domain Bias**: There are significant differences in user and item characteristics between different markets or platforms, and these differences lead to cross - domain bias. For example, users in different countries may have different preferences for the same type of product, or the price and promotion strategies of the same product in different markets are different. 2. **In - Domain Bias**: Even within the same market or platform, there are also some biases, such as popularity bias, that is, users tend to interact with popular items, which will affect the user interaction patterns and the performance of the recommendation system. To address these challenges, the paper proposes a new Bayesian deep - learning framework named **PreRec**, which has a causal de - biasing mechanism. The main contributions of PreRec include: - **Identifying Biases**: The paper first identifies the in - domain biases and cross - domain biases existing in multi - domain data, which may weaken the generalization ability of pre - trained recommendation systems. - **Designing the Model**: A new Bayesian deep - learning model PreRec is proposed, which can extract general knowledge from multi - domain data and eliminate biases through causal intervention. - **Experimental Verification**: Extensive experiments were carried out on multiple datasets to evaluate the performance of PreRec in zero - sample, incremental training, and fine - tuning settings. The results show that PreRec can significantly improve the recommendation performance in cross - market and cross - platform scenarios. ### Model Overview The core idea of the PreRec model is to consider in - domain biases and cross - domain biases during the pre - training stage and eliminate these biases through causal intervention during the inference stage. Specifically: - **Multi - Domain Pre - training**: PreRec is pre - trained on data from multiple source domains while considering cross - domain biases and in - domain biases. - **Zero - Sample Recommendation**: In the target domain, PreRec can make recommendations without the target domain interaction data, removing cross - domain biases and injecting in - domain biases of the target domain through causal intervention. - **Fine - Tuning**: When the interaction data of the target domain is available, PreRec can further fine - tune the model parameters, capture the biases of the target domain and adjust the recommendation performance. ### Key Technologies - **Causal De - biasing Mechanism**: By introducing explicit and implicit bias terms (such as domain attribute \(D_k\) and popularity bias \(Z_j\)), and performing causal intervention during the inference stage, cross - domain biases are removed and in - domain biases are retained. - **Hierarchical Bayesian Model**: A hierarchical Bayesian framework is used to model the latent properties of users, items, and domains, so as to better capture the cross - domain and in - domain dynamic changes. ### Experimental Results The paper conducted experiments on multiple datasets, including the XMarket dataset and the Online Retail dataset. The experimental results show that: - **Zero - Sample Performance**: PreRec significantly outperforms the baseline methods in the zero - sample setting. - **De - biasing Effect**: The proposed causal de - biasing mechanism effectively reduces in - domain biases and cross - domain biases and improves the recommendation performance. - **Fine - Tuning Performance**: When fine - tuning is allowed, the performance of PreRec further improves as the number of fine - tuning samples increases. In summary, through proposing the PreRec model, this paper successfully solves the data scarcity problem in recommendation systems and shows excellent recommendation performance in cross - market and cross - platform scenarios.