Abstract:Nowadays, interactive recommendation systems (IRS) play a significant role in our daily life. Recently, reinforcement learning has shown great potential in solving challenging tasks in IRS, since it can focus on long-term profit and can capture the dynamic preference of users. However, existing RL methods for IRS have two typical deficiencies. First, most state representation models use left-to-right recurrent neural networks to capture the user dynamics, which usually fail to handle the long and noisy sequential data in real life. Second, an IRS always needs to handle millions of items, leading to a large discrete action space in RL settings, which has not been fully addressed by the inefficient existing works. To bridge these deficiencies, in this paper, we propose attention-based tree recommendation (ATRec), an efficient tree-structured policy with attention-based state representation for IRS. ATRec uses an attention-based state representation model to effectively capture the user's dynamic preference hidden in the long and noisy sequence of behaviors. Moreover, to improve the learning efficiency, we propose an efficient tree-structured policy representation method, in which a complete tree is devised to represent the policy, and a novel parameter-sharing strategy is introduced. Extensive experiments are conducted on three real-world datasets and the results show the proposed ATRec obtains 42.3% improvement over some of the state of the arts methods in the hit rate and 21.4% improvement in the mean reciprocal rank of the top 30 ranked items. Additionally, the learning and decision efficiency can also be improved at an average of 35.5%.

Efficient Interactive Recommendation via Huffman Tree-based Policy Learning

Efficient Tree Policy with Attention-Based State Representation for Interactive Recommendation

Large-scale Interactive Recommendation with Tree-structured Policy Gradient

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning

Balancing Accuracy and Fairness for Interactive Recommendation with Reinforcement Learning

Sim-to-Real Interactive Recommendation via Off-Dynamics Reinforcement Learning

Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning

A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations

RECENT ADVANCES IN PERSONAL RECOMMENDER SYSTEMS

UISA: User Information Separating Architecture for Commodity Recommendation Policy with Deep Reinforcement Learning

Session-based Interactive Recommendation Via Deep Reinforcement Learning

Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation

Influential Recommender System

Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning

Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling

Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation

Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning

A Knowledge Graph-based Interactive Recommender System Using Reinforcement Learning

Conservative Q-Improvement: Reinforcement Learning for an Interpretable Decision-Tree Policy

Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning