Abstract:Recommender systems learn personalized user preferences from user feedback like clicks. However, user feedback is usually biased towards partially observed interests, leaving many users' hidden interests unexplored. Existing approaches typically mitigate the bias, increase recommendation diversity, or use bandit algorithms to balance exploration-exploitation trade-offs. Nevertheless, they fail to consider the potential rewards of recommending different categories of items and lack the global scheduling of allocating top-N recommendations to categories, leading to suboptimal exploration. In this work, we propose an Uplift model-based Recommender (UpliftRec) framework, which regards top-N recommendation as a treatment optimization problem. UpliftRec estimates the treatment effects, i.e., the click-through rate (CTR) under different category exposure ratios, by using observational user feedback. UpliftRec calculates group-level treatment effects to discover users' hidden interests with high CTR rewards and leverages inverse propensity weighting to alleviate confounder bias. Thereafter, UpliftRec adopts a dynamic programming method to calculate the optimal treatment for overall CTR maximization. We implement UpliftRec on different backend models and conduct extensive experiments on three datasets. The empirical results validate the effectiveness of UpliftRec in discovering users' hidden interests while achieving superior recommendation accuracy.

What problem does this paper attempt to address?

This paper attempts to solve the problem of user - interest exploration in recommendation systems. Specifically, existing recommendation systems usually learn personalized user preferences from user feedback (such as clicks), but this feedback is often biased towards some of the observed interests, resulting in many users' hidden interests not being explored. This not only limits the diversity of the recommendation system but may also lead to problems such as filter bubbles and echo - chamber effects. To solve these problems, the paper proposes a recommendation framework based on the uplift model (UpliftRec), which views the top \( N \) recommendations as a treatment - optimization problem. UpliftRec estimates the click - through rate (CTR) under different category - exposure ratios by using the observed user feedback and calculates the treatment effect at the group level to discover hidden interests with high CTR rewards. In addition, UpliftRec uses the inverse - propensity - weighting (IPW) technique to reduce confounding bias and adopts a dynamic - programming method to calculate the optimal treatment strategy for overall CTR maximization. ### Main Contributions 1. **User - Interest Exploration Task from a Causal Perspective**: The paper redefines the user - interest exploration task from a causal perspective, emphasizing the importance of estimating user treatment effects across multiple item categories. 2. **Novel UpliftRec Framework**: This framework can alleviate various challenges in estimating treatment effects and achieve reliable treatment - effect estimation and optimal treatment selection. 3. **Experimental Verification**: Extensive experiments on three datasets verify the effectiveness of the proposed method in exploring new interests while improving CTR performance. ### Method Overview 1. **Treatment - Effect Estimation**: - **Enhanced Dataset Generation**: Generate samples for effect estimation by observing users' interaction histories. - **Confounding - Factor Debiasing**: Use the inverse - propensity - weighting (IPW) technique to reduce the influence of confounding factors. - **ADRF Calculation**: Calculate the average dose - response function (ADRF) for each category through discretization. 2. **Optimal Treatment Selection**: - **Dynamic Programming**: Transform the optimization problem into a knapsack problem and use a dynamic - programming algorithm to find the optimal exposure - resource - allocation scheme. - **MTEF Approximation**: To address the high - variance problem of the ADRF solution, a simplified version of the MTEF estimate is proposed, which alleviates the impact of sparse data through gradient approximation. ### Experimental Results The paper conducted extensive experiments on three real - world datasets to evaluate the performance of UpliftRec. The experimental results show that UpliftRec significantly improves the accuracy of recommendations while exploring users' hidden interests. ### Evaluation Metrics - **Accuracy Metrics**: Recall@K and NDCG@K. - **Exploration - Ability Metrics**: - **RUE@K**: Measures the recall rate of unexpected items in the top \( K \) recommendation lists, where unexpected items refer to those whose categories are not in the top 3 categories of the user - interaction history. - **RUP@K**: Measures the recall rate of unpopular items in the top \( K \) recommendation lists, where unpopular items refer to those whose interaction times in the training set are in the bottom 90%. Through these methods and experiments, the paper successfully solves the problem of exploring users' hidden interests in recommendation systems and provides effective solutions.

Treatment Effect Estimation for User Interest Exploration on Recommender Systems

User-Aware Multi-Interest Learning for Candidate Matching in Recommenders

Uplift Modeling for Multiple Treatments with Cost Optimization

Uplift Modeling for Target User Attacks on Recommender Systems

Uplift Modeling with Multiple Treatments and General Response Types

Enhancing Uplift Modeling in Multi-Treatment Marketing Campaigns: Leveraging Score Ranking and Calibration Techniques

Understanding and Counteracting Feature-Level Bias in Click-Through Rate Prediction

Feature Selection Methods for Uplift Modeling and Heterogeneous Treatment Effect

Entire Chain Uplift Modeling with Context-Enhanced Learning for Intelligent Marketing

Hierarchical Interest Modeling of Long-tailed Users for Click-Through Rate Prediction.

Learning Discriminative Representation Base on Attention for Uplift

Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach

Soft Retargeting Network for Click Through Rate Prediction

Coarse-to-fine Dynamic Uplift Modeling for Real-time Video Recommendation

Imbalance-Aware Uplift Modeling for Observational Data

Exploiting User Interests for Collaborative Filtering

Data-driven smoothing approaches for interest modeling in recommendation systems

Enhancing collaborative filtering by user interest expansion via personalized ranking

Who Should Be Given Incentives? Counterfactual Optimal Treatment Regimes Learning for Recommendation

Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation.

Give Me Something Unknown: Incorporate Exploration Preference In Cognition Into Recommender System