DCMT: A Direct Entire-Space Causal Multi-Task Framework for Post-Click Conversion Estimation.
Feng Zhu,Mingjie Zhong,Xinxing Yang,Longfei Li,Lu Yu,Tiehua Zhang,Jun Zhou,Chaochao Chen,Fei Wu,Guanfeng Liu,Yan Wang
DOI: https://doi.org/10.1109/icde55515.2023.00239
2023-01-01
Abstract:In recommendation scenarios, there are two long-standing challenges, i.e., selection bias and data sparsity, which lead to a significant drop in prediction accuracy for both Click-Through Rate (CTR) and post-click Conversion Rate (CVR) tasks. To cope with these issues, existing works emphasize on leveraging Multi-Task Learning (MTL) frameworks (Category 1) or causal debiasing frameworks (Category 2) to incorporate more auxiliary data in the entire exposure/inference space $\mathcal{D}$ or debias the selection bias in the click/training space ${\mathcal{O}}$. However, these two kinds of solutions cannot effectively address the not-missing-at-random problem and debias the selection bias in ${\mathcal{O}}$ to fit the inference in $\mathcal{D}$. To fill the research gaps, we propose a Direct entire-space Causal Multi-Task framework, namely DCMT, for post-click conversion prediction in this paper. Specifically, inspired by users’ decision process of conversion, we propose a new counterfactual mechanism to debias the selection bias in $\mathcal{D}$, which can predict the factual CVR and the counterfactual CVR under the soft constraint of a counterfactual prior knowledge. Extensive experiments demonstrate that our DCMT can improve the state-of-the-art methods by an average of 1.07% in term of CVR AUC on the offline datasets and 0.75% in term of PV-CVR on the online A/B test (the Alipay Search). Such improvements can increase millions of conversions per week in real industrial applications, e.g., the Alipay Search.