Adaptive Hardness Negative Sampling for Collaborative Filtering

Riwei Lai,Rui Chen,Qilong Han,Chi Zhang,Li Chen
2024-01-10
Abstract:Negative sampling is essential for implicit collaborative filtering to provide proper negative training signals so as to achieve desirable performance. We experimentally unveil a common limitation of all existing negative sampling methods that they can only select negative samples of a fixed hardness level, leading to the false positive problem (FPP) and false negative problem (FNP). We then propose a new paradigm called adaptive hardness negative sampling (AHNS) and discuss its three key criteria. By adaptively selecting negative samples with appropriate hardnesses during the training process, AHNS can well mitigate the impacts of FPP and FNP. Next, we present a concrete instantiation of AHNS called AHNS_{p<0}, and theoretically demonstrate that AHNS_{p<0} can fit the three criteria of AHNS well and achieve a larger lower bound of normalized discounted cumulative gain. Besides, we note that existing negative sampling methods can be regarded as more relaxed cases of AHNS. Finally, we conduct comprehensive experiments, and the results show that AHNS_{p<0} can consistently and substantially outperform several state-of-the-art competitors on multiple datasets.
Information Retrieval
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in Implicit Collaborative Filtering (IC), the existing negative sampling methods can only select negative samples of a fixed difficulty level, which leads to the False Positive Problem (FPP) and the False Negative Problem (FNP). Specifically: 1. **False Positive Problem (FPP)**: During the training process, if only easy negative samples can be selected, then uninteresting items that are not fully updated due to their high initial prediction scores may be recommended to users, resulting in sub - optimal recommendation results. 2. **False Negative Problem (FNP)**: If only fixed - difficulty - level hard negative samples can be selected during the training process, then interesting items that have not been interacted with may be selected as negative samples and ranked low in the recommendation list, thus worsening the recommendation results. To solve these problems, the author proposes a new negative sampling paradigm - Adaptive Hardness Negative Sampling (AHNS), which adaptively selects negative samples of different difficulty levels during the training process to alleviate the effects of FPP and FNP, thereby obtaining better recommendation results.