Mitigating Pooling Bias in E-commerce Search via False Negative Estimation

Xiaochen Wang,Xiao Xiao,Ruhan Zhang,Xuan Zhang,Taesik Na,Tejaswi Tenneti,Haixun Wang,Fenglong Ma
2024-08-17
Abstract:Efficient and accurate product relevance assessment is critical for user experiences and business success. Training a proficient relevance assessment model requires high-quality query-product pairs, often obtained through negative sampling strategies. Unfortunately, current methods introduce pooling bias by mistakenly sampling false negatives, diminishing performance and business impact. To address this, we present Bias-mitigating Hard Negative Sampling (BHNS), a novel negative sampling strategy tailored to identify and adjust for false negatives, building upon our original False Negative Estimation algorithm. Our experiments in the Instacart search setting confirm BHNS as effective for practical e-commerce use. Furthermore, comparative analyses on public dataset showcase its domain-agnostic potential for diverse applications.
Information Retrieval,Machine Learning
What problem does this paper attempt to address?
This paper attempts to address the issue of pooling bias introduced by negative sample sampling strategies in e-commerce search. Specifically, current methods mistakenly label some actually relevant query-product pairs (i.e., false negatives) as irrelevant during negative sample sampling, which reduces model performance and business impact. The paper proposes a new negative sample sampling strategy—Bias-mitigating Hard Negative Sampling (BHNS), aimed at identifying and adjusting these false negatives to improve the performance of relevance evaluation models in e-commerce search. ### Main Contributions: 1. **Proposing the BHNS Strategy**: Combining the False Negative Estimation (FNE) algorithm, it adjusts the negative sample sampling process by estimating the probability of query-product pairs becoming false negatives. 2. **Experimental Validation**: Experiments were conducted in Instacart's actual search scenarios and on public datasets to verify the effectiveness and generalization ability of BHNS. 3. **Dual Insurance Mechanism**: Further reduces the impact of pooling bias through two methods: sampling regularization and pseudo-label generation. ### Method Overview: - **False Negative Estimation (FNE)**: Uses semantic similarity to estimate the probability of query-product pairs becoming false negatives. Specifically, if two queries are semantically similar, the relevance of the same product corresponding to them may also be similar. - **Sampling Regularization**: When selecting hard negative samples, FNE is introduced as a regularization term to reduce the probability of false negatives being mistakenly selected. - **Pseudo-label Generation**: Generates pseudo-labels for potential false negatives to further reduce pooling bias. ### Experimental Results: - **Performance on Public Datasets**: Experimental results on the STS benchmark dataset show that BHNS outperforms other baseline methods on multiple evaluation metrics, especially in handling false negatives. - **Offline Experiments**: On Instacart's actual dataset, BHNS also demonstrated superior performance, effectively mitigating pooling bias and improving the accuracy of search relevance evaluation. ### Conclusion: The BHNS strategy proposed in the paper effectively addresses the issue of pooling bias introduced by negative sample sampling in e-commerce search. Through false negative estimation and a dual insurance mechanism, it significantly improves model performance and business outcomes.