Treatment Effect Estimation for User Interest Exploration on Recommender Systems

Jiaju Chen,Wenjie Wang,Chongming Gao,Peng Wu,Jianxiong Wei,Qingsong Hua
DOI: https://doi.org/10.1145/3626772.3657736
2024-05-14
Abstract:Recommender systems learn personalized user preferences from user feedback like clicks. However, user feedback is usually biased towards partially observed interests, leaving many users' hidden interests unexplored. Existing approaches typically mitigate the bias, increase recommendation diversity, or use bandit algorithms to balance exploration-exploitation trade-offs. Nevertheless, they fail to consider the potential rewards of recommending different categories of items and lack the global scheduling of allocating top-N recommendations to categories, leading to suboptimal exploration. In this work, we propose an Uplift model-based Recommender (UpliftRec) framework, which regards top-N recommendation as a treatment optimization problem. UpliftRec estimates the treatment effects, i.e., the click-through rate (CTR) under different category exposure ratios, by using observational user feedback. UpliftRec calculates group-level treatment effects to discover users' hidden interests with high CTR rewards and leverages inverse propensity weighting to alleviate confounder bias. Thereafter, UpliftRec adopts a dynamic programming method to calculate the optimal treatment for overall CTR maximization. We implement UpliftRec on different backend models and conduct extensive experiments on three datasets. The empirical results validate the effectiveness of UpliftRec in discovering users' hidden interests while achieving superior recommendation accuracy.
Information Retrieval
What problem does this paper attempt to address?
This paper attempts to solve the problem of user - interest exploration in recommendation systems. Specifically, existing recommendation systems usually learn personalized user preferences from user feedback (such as clicks), but this feedback is often biased towards some of the observed interests, resulting in many users' hidden interests not being explored. This not only limits the diversity of the recommendation system but may also lead to problems such as filter bubbles and echo - chamber effects. To solve these problems, the paper proposes a recommendation framework based on the uplift model (UpliftRec), which views the top \( N \) recommendations as a treatment - optimization problem. UpliftRec estimates the click - through rate (CTR) under different category - exposure ratios by using the observed user feedback and calculates the treatment effect at the group level to discover hidden interests with high CTR rewards. In addition, UpliftRec uses the inverse - propensity - weighting (IPW) technique to reduce confounding bias and adopts a dynamic - programming method to calculate the optimal treatment strategy for overall CTR maximization. ### Main Contributions 1. **User - Interest Exploration Task from a Causal Perspective**: The paper redefines the user - interest exploration task from a causal perspective, emphasizing the importance of estimating user treatment effects across multiple item categories. 2. **Novel UpliftRec Framework**: This framework can alleviate various challenges in estimating treatment effects and achieve reliable treatment - effect estimation and optimal treatment selection. 3. **Experimental Verification**: Extensive experiments on three datasets verify the effectiveness of the proposed method in exploring new interests while improving CTR performance. ### Method Overview 1. **Treatment - Effect Estimation**: - **Enhanced Dataset Generation**: Generate samples for effect estimation by observing users' interaction histories. - **Confounding - Factor Debiasing**: Use the inverse - propensity - weighting (IPW) technique to reduce the influence of confounding factors. - **ADRF Calculation**: Calculate the average dose - response function (ADRF) for each category through discretization. 2. **Optimal Treatment Selection**: - **Dynamic Programming**: Transform the optimization problem into a knapsack problem and use a dynamic - programming algorithm to find the optimal exposure - resource - allocation scheme. - **MTEF Approximation**: To address the high - variance problem of the ADRF solution, a simplified version of the MTEF estimate is proposed, which alleviates the impact of sparse data through gradient approximation. ### Experimental Results The paper conducted extensive experiments on three real - world datasets to evaluate the performance of UpliftRec. The experimental results show that UpliftRec significantly improves the accuracy of recommendations while exploring users' hidden interests. ### Evaluation Metrics - **Accuracy Metrics**: Recall@K and NDCG@K. - **Exploration - Ability Metrics**: - **RUE@K**: Measures the recall rate of unexpected items in the top \( K \) recommendation lists, where unexpected items refer to those whose categories are not in the top 3 categories of the user - interaction history. - **RUP@K**: Measures the recall rate of unpopular items in the top \( K \) recommendation lists, where unpopular items refer to those whose interaction times in the training set are in the bottom 90%. Through these methods and experiments, the paper successfully solves the problem of exploring users' hidden interests in recommendation systems and provides effective solutions.