Dynamic Assortment with Online Learning under Threshold Multinomial Logit Model

Wenxiang Chen,Caihua Chen,Houcai Shen,Ruxian Wang,Weili Xue
DOI: https://doi.org/10.2139/ssrn.4844426
2024-01-01
SSRN Electronic Journal
Abstract:Consumers often find themselves overwhelmed by extensive assortments offered by online retailers and show bounded rationality behavior. However, existing literature on dynamic assortment optimization didn't consider consumers' such bounded rationality behavior. This motivates us to employ a simple and effective two-stage consider-then-choose model, namely the Threshold Multinomial Logit (TMNL) model to investigate the online assortment optimization problem. The TMNL model characterizes consumers' endogenous consideration sets formation by the threshold effect. This endogenous dependency can capture more flexible substitution patterns than the classical MNL choice model, but it also creates great difficulties for online learning. In the offline assortment setting, we analyze the properties of optimal assortment and propose an efficient assortment optimization algorithm outperforms the benchmark. In the online setting with unknown customer preferences and consideration set formation, we propose online learning algorithms that achieve nearly optimal regret bounds under both instance-independent and instance-dependent conditions. To the best of our knowledge, we are the first work to consider online assortment problem with consumers' endogenous consider-then-choose behavior. Moreover, our algorithm is extended to the contextual learning setting, effectively mitigating the impact of the number of products on its performance. Extensive numerical experiments validate the efficacy of our proposed algorithms.
What problem does this paper attempt to address?