Sequential Advertising Agent with Interpretable User Hidden Intents

Zhaoqing Peng,Junqi Jin,Lan Luo,Yaodong Yang,Rui Luo,Jun Wang,Weinan Zhang,Miao Xu,Chuan Yu,Tiejian Luo,Han Li,Jian Xu,Kun Gai
DOI: https://doi.org/10.5555/3398761.3399043
2020-01-01
Abstract:Online advertising campaigns are typically launched for a customer across multiple touch points (scenarios) before the conversion of his final purchase. To maximize the advertisers' revenue, it requires the platform to develop its advertising strategy based on the consumers' behavioral trajectories in the previous scenarios. Meanwhile, it is also critical to maintain the interpretability of the models on the conversion rate; however, modern reinforcement learning based solutions fail to do so due to their black-box modeling on the consumer intents. In this paper, we model consumer's purchase intention as a latent variable, and formulate the advertising problem as a partially observed Markov Decision Process (POMDP). We incorporate the expectation maximization (EM) algorithms for solving the optimal POMDP. Our extensive experiments based on large-scale real-world data demonstrate that our method provides superior performance over several baselines. Apart from the improved advertising performance, our method is able to offer interpretation on the attribution of the conversion.
What problem does this paper attempt to address?