Abstract:The increasing popularity of the recommender system deeply influences our decisions on the Internet, which is a typical continuous interaction process between the system and its users. Most previous recommender systems heavily focus on optimizing recommendation accuracy while neglecting the other important aspects of recommendation quality, such as diversity of recommendation list. In this study, we propose a novel recommendation framework to optimize the recommendation list for the Top-N task, named Collaborative Filtering-based Deep Reinforcement Learning (CFDRL), which promotes the diversity of recommendation results without sacrificing the recommendation accuracy. More specifically, to effectively capture the continuous user-item interaction for recommendations, we adopt the deep reinforcement learning (DRL) to update the recommendation strategy dynamically according to the user’s real-time feedback. Meanwhile, to generate diverse and complementary items for recommendation, we design a diversity-aware reward function that can lead to maximizing reward with the trade-off between diversity and accuracy. Besides, to alleviate the disadvantage of DQN that directly picking the recommendations with the highest Q-values from the unselected items, we define a modified ε$$\varepsilon$$-greedy explore policy with jointly CF model. It firstly utilizes CF model to sort the items and divide them into two part according to the item similarity, then with a probability the agent selects from them and generates an action list with the modified ε$$\varepsilon$$-greedy explore policy. The experimental results conducted on two real-world e-commerce datasets demonstrate the effectiveness of the proposed model.

Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation

Exploiting Structural and Temporal Influence for Dynamic Social-Aware Recommendation

A stable deep reinforcement learning framework for recommendation

Pseudo Dyna-Q

Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems

Deep Reinforcement Learning for List-wise Recommendations

Sim-to-Real Interactive Recommendation via Off-Dynamics Reinforcement Learning

A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference

Relieving Popularity Bias in Interactive Recommendation: A Diversity-Novelty-Aware Reinforcement Learning Approach

Reinforcing User Retention in a Billion Scale Short Video Recommender System

User Retention-oriented Recommendation with Decision Transformer.

Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility

Q-ADER: An Effective Q-Learning for Recommendation With Diminishing Action Space

Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Efficient Deep Reinforcement Learning-Enabled Recommendation

A General Offline Reinforcement Learning Framework for Interactive Recommendation

DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems

Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling

Top-aware reinforcement learning based recommendation

Diversity-Aware Top-N Recommendation: A Deep Reinforcement Learning Way