Abstract:Recommender system usually faces popularity bias. From the popularity distribution shift perspective, the normal paradigm trained on exposed items (most are hot items) identifies that recommending popular items more frequently can achieve lower loss, thus injecting popularity information into item property embedding, e.g., id embedding. From the long-tail distribution shift perspective, the sparse interactions of long-tail items lead to insufficient learning of them. The resultant distribution discrepancy between hot and long-tail items would not only inherit the bias, but also amplify the bias. Existing work addresses this issue with inverse propensity scoring (IPS) or causal embeddings. However, we argue that not all popularity biases mean bad effects, i.e., some items show higher popularity due to better quality or conform to current trends, which deserve more recommendations. Blindly seeking unbiased learning may inhibit high-quality or fashionable items. To make better use of the popularity bias, we propose a co-training disentangled domain adaptation network (CD$^2$AN), which can co-train both biased and unbiased models. Specifically, for popularity distribution shift, CD$^2$AN disentangles item property representation and popularity representation from item property embedding. For long-tail distribution shift, we introduce additional unexposed items (most are long-tail items) to align the distribution of hot and long-tail item property representations. Further, from the instances perspective, we carefully design the item similarity regularization to learn comprehensive item representation, which encourages item pairs with more effective co-occurrences patterns to have more similar item property representations. Based on offline evaluations and online A/B tests, we show that CD$^2$AN outperforms the existing debiased solutions. Currently, CD$^2$AN has been successfully deployed at Mobile Taobao App and handling major online traffic.

Uncovering the Propensity Identification Problem in Debiased Recommendations

Removing Hidden Confounding in Recommendation: A Unified Multi-Task Learning Approach

Propensity Matters: Measuring and Enhancing Balancing for Recommendation.

Balancing Unobserved Confounding with a Few Unbiased Ratings in Debiased Recommendations

Co-training Disentangled Domain Adaptation Network for Leveraging Popularity Bias in Recommenders

Cross Pairwise Ranking for Unbiased Item Recommendation

Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings

Relaxing the Accurate Imputation Assumption in Doubly Robust Learning for Debiased Collaborative Filtering

Addressing Unmeasured Confounder for Recommendation with Sensitivity Analysis

De-Selection Bias Recommendation Algorithm Based on Propensity Score Estimation

Debiased Recommendation with Noisy Feedback

Debiased Recommendation with User Feature Balancing

Doubly Calibrated Estimator for Recommendation on Data Missing Not At Random

MDI: A Debiasing Method Combining Unbiased and Biased Data.

Be Causal: De-Biasing Social Network Confounding in Recommendation

Unbiased Sequential Recommendation with Latent Confounders

ReCRec: Reasoning the Causes of Implicit Feedback for Debiased Recommendation

Unbiased Learning to Rank with Unbiased Propensity Estimation

Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random

Correcting the User Feedback-Loop Bias for Recommendation Systems

Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback