Abstract:Previous chapter Next chapter Full AccessProceedings Proceedings of the 2014 SIAM International Conference on Data Mining (SDM)Contextual Combinatorial Bandit and its Application on Diversified Online RecommendationLijing Qin, Shouyuan Chen, and Xiaoyan ZhuLijing Qin, Shouyuan Chen, and Xiaoyan Zhupp.461 - 469Chapter DOI:https://doi.org/10.1137/1.9781611973440.53PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Recommender systems are faced with new challenges that are beyond traditional techniques. For example, most traditional techniques are based on similarity or overlap among existing data, however, there may not exist sufficient historical records for some new users to predict their preference, or users can hold diverse interest, but the similarity based methods may probably over-narrow it. To address the above challenges, we develop a principled approach called contextual combinatorial bandit in which a learning algorithm can dynamically identify diverse items that interest a new user. Specifically, each item is represented as a feature vector, and each user is represented as an unknown preference vector. On each of n rounds, the bandit algorithm sequentially selects a set of items according to the item-selection strategy that balances exploration and exploitation, and collects the user feedback on these selected items. A reward function is further designed to measure the quality (e.g. relevance or diversity) of the selected set based on observed feedback, and the goal of the algorithm is to maximize the total rewards of n rounds. The reward function only needs to satisfy two mild assumptions that is general enough to accommodate a large class of nonlinear functions. To solve this bandit problem, we provide algorithm that achieves Õ(√n) regret after playing n rounds. Experiments conducted on real-wold movie recommendation dataset demonstrate that our approach can effectively address the above challenges and hence improve the performance of recommendation task. Previous chapter Next chapter RelatedDetails Published:2014eISBN:978-1-61197-344-0 https://doi.org/10.1137/1.9781611973440Book Series Name:ProceedingsBook Code:PRDT14Book Pages:1-1086

BiUCB: A Contextual Bandit Algorithm for Cold-Start and Diversified Recommendation

Transferable Contextual Bandits with Prior Observations

HELLINGER-UCB: A novel algorithm for stochastic multi-armed bandit problem and cold start problem in recommender system

Bandit Learning for Diversified Interactive Recommendation

Conservative Contextual Combinatorial Cascading Bandit

Conversational Contextual Bandit: Algorithm and Application

Contextual Bandit Approach-based Recommendation System for Personalized Web-based Services

Cold-start Problems in Recommendation Systems via Contextual-bandit Algorithms

Contextual Combinatorial Bandit and Its Application on Diversified Online Recommendation

Achieving Counterfactual Fairness for Causal Bandit.

Context-Aware Bandits

Achieving User-Side Fairness in Contextual Bandits

Fairness-aware Bandit-based Recommendation

Con-CNAME: A Contextual Multi-armed Bandit Algorithm for Personalized Recommendations

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Diversity-Preserving K-Armed Bandits, Revisited

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System

Enhancing Recommendation Accuracy and Diversity with Box Embedding: A Universal Framework

Combinatorial Multi-Armed Bandit: General Framework and Applications.

Old Dog Learns New Tricks: Randomized UCB for Bandit Problems