Abstract:Recently, contextual multiarmed bandits (CMAB)-based recommendation has shown promise for applications in dynamic domains such as news or short video recommendation, where items are changing over time. There are two key challenges in constructing a CMAB-based system. First, a user’s historical rating data are usually sparse, which restricts the precise representation of dynamic contextual information. Second, it is difficult to design a personalized recommendation policy because of the high diversity of users’ selection behaviors over time. Therefore, existing CMAB-based recommendation methods mainly focus on the improvement of recommendation accuracy. This leads to difficulty in capturing the dynamic personalized preferences of users. To overcome these limitations, we design a novel knowledge-enhanced contextual-bandit approach that combines the dynamic recommendation strategy of the bandit algorithm and the knowledge representation of the knowledge graph. We argue that a user’s selection history for items reflects his/her favorite attributes, which will be repeated in the future. Accordingly we propose the knowledge-enhanced CMAB model, which leverages the contextual information of both users and items obtained by knowledge graph-based embedding. We validate the performance of the proposed model using two public datasets. Experimental results show that our method outperforms state-of-the-art methods in terms of both recommendation accuracy and diversity. • We combine a knowledge graph and the bandit method for constructing personalized, dynamic recommendations. • We propose an intention-selection mechanism for item selection in recommender systems. • We propose the one choice multi change strategy for a contextual multiarmed bandit. • Knowledge graphs effectively address the lack of knowledge in recommender systems.

Interactive Recommendation Via Deep Neural Memory Augmented Contextual Bandits

Neural Contextual Bandits for Personalized Recommendation

Sequential Modeling of Hierarchical User Intention and Preference for Next-item Recommendation

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

Diverse Interaction Recommendation for Public Users Exploring Multi-view Visualization Using Deep Learning

Exploiting Structural and Temporal Influence for Dynamic Social-Aware Recommendation

A Knowledge-Enhanced Contextual Bandit Approach for Personalized Recommendation in Dynamic Domains

HyperBandit: Contextual Bandit with Hypernewtork for Time-Varying User Preferences in Streaming Recommendation

Gated Sequential Recommendation with Dynamic Memory Network

Adaptive User Modeling with Long and Short-Term Preferences for Personalized Recommendation.

A Contextual-Bandit Approach to Personalized News Article Recommendation

Bandit Learning for Diversified Interactive Recommendation

Sequential Recommendation with User Memory Networks

Latent Contextual Bandits and their Application to Personalized Recommendations for New Users

Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions

DeepInteract: Multi-view Features Interactive Learning for Sequential Recommendation

Deep Adaptive Interest Network: Personalized Recommendation with Context-Aware Learning

Neural Interactive Collaborative Filtering

Signed Distance-based Deep Memory Recommender

Interactive Social Recommendation

An Attention-Based Recommender System to Predict Contextual Intent Based on Choice Histories Across and Within Sessions