Unbiased Counterfactual Estimation of Ranking Metrics

Haining Yu
2021-01-01
Abstract:We propose a novel method to estimate metrics for a ranking policy, based on behavioral signal data (e.g. clicks or viewing of video contents) generated by a second different policy. Building on [1], we prove the counterfactual estimator is unbiased, and discuss its low-variance property. The estimator can be used to evaluate ranking model performance offline, to validate and selection positional bias models, and to serve as learning objectives when training new models.
What problem does this paper attempt to address?