Identify Then Recommend: Towards Unsupervised Group Recommendation

Yue Liu,Shihao Zhu,Tianyuan Yang,Jian Ma,Wenliang Zhong
2024-10-31
Abstract:Group Recommendation (GR), which aims to recommend items to groups of users, has become a promising and practical direction for recommendation systems. This paper points out two issues of the state-of-the-art GR models. (1) The pre-defined and fixed number of user groups is inadequate for real-time industrial recommendation systems, where the group distribution can shift dynamically. (2) The training schema of existing GR methods is supervised, necessitating expensive user-group and group-item labels, leading to significant annotation costs. To this end, we present a novel unsupervised group recommendation framework named \underline{I}dentify \underline{T}hen \underline{R}ecommend (\underline{ITR}), where it first identifies the user groups in an unsupervised manner even without the pre-defined number of groups, and then two pre-text tasks are designed to conduct self-supervised group recommendation. Concretely, at the group identification stage, we first estimate the adaptive density of each user point, where areas with higher densities are more likely to be recognized as group centers. Then, a heuristic merge-and-split strategy is designed to discover the user groups and decision boundaries. Subsequently, at the self-supervised learning stage, the pull-and-repulsion pre-text task is proposed to optimize the user-group distribution. Besides, the pseudo group recommendation pre-text task is designed to assist the recommendations. Extensive experiments demonstrate the superiority and effectiveness of ITR on both user recommendation (e.g., 22.22\% NDCG@5 $\uparrow$) and group recommendation (e.g., 22.95\% NDCG@5 $\uparrow$). Furthermore, we deploy ITR on the industrial recommender and achieve promising results.
Information Retrieval
What problem does this paper attempt to address?
This paper attempts to solve two main problems: 1. **The problem of the number of dynamic user groups**: - Existing state - of - the - art group recommendation (GR) models rely on a predefined and fixed number of user groups, which is not sufficient in real - time industrial recommendation systems because the group distribution may change dynamically. In practical applications, the number of user groups is usually unknown and changes dynamically. 2. **The cost problem of supervised training schemes**: - Existing GR methods adopt supervised training schemes and require a large number of user - group and group - item labels, which leads to high labeling costs. The acquisition of these labels is not only expensive but also difficult to achieve in large - scale datasets. To solve these problems, the authors propose a new unsupervised group recommendation framework named "Identify Then Recommend (ITR)". The main features of this framework are as follows: - **No need to pre - define the number of groups**: ITR can automatically identify user groups without pre - defining the number of user groups. - **Self - supervised learning**: By designing two pre - training tasks (the pull - and - repulsion pre - training task and the pseudo - group - recommendation pre - training task), ITR can perform effective group recommendations without labels. Specifically, ITR is divided into two modules: 1. **Group Identification Module**: - Automatically identify user groups through adaptive density estimation and heuristic merge - split strategies. First, estimate the density of each user and adjust the group center dynamically according to the density. Then optimize the group structure by merging similar users and splitting different user groups. 2. **Self - supervised Group Recommendation Module**: - Optimize group embeddings and user embeddings through two pre - training tasks. The pull - and - repulsion pre - training task aims to pull users towards their own groups and push away different groups; the pseudo - group - recommendation pre - training task generates pseudo - labels to guide the model in group recommendation. Finally, the ITR model has demonstrated its superiority and effectiveness in multiple experiments, especially achieving significant performance improvements in user recommendation and group recommendation tasks. In addition, ITR has been successfully deployed in large - scale industrial recommendation systems and has achieved satisfactory results. In summary, this paper solves the limitations of existing group recommendation methods in terms of dynamic group numbers and supervised training costs by proposing the ITR framework, providing a more suitable solution for practical application scenarios.