One Backpropagation in Two Tower Recommendation Models

Erjia Chen,Bang Wang
2024-10-08
Abstract:Recent years have witnessed extensive researches on developing two tower recommendation models for relieving information overload. Four building modules can be identified in such models, namely, user-item encoding, negative sampling, loss computing and back-propagation updating. To the best of our knowledge, existing algorithms have researched only on the first three modules, yet neglecting the backpropagation module. They all adopt a kind of two backpropagation strategy, which are based on an implicit assumption of equally treating users and items in the training phase. In this paper, we challenge such an equal training assumption and propose a novel one backpropagation updating strategy, which keeps the normal gradient backpropagation for the item encoding tower, but cuts off the backpropagation for the user encoding tower. Instead, we propose a moving-aggregation updating strategy to update a user encoding in each training epoch. Except the proposed backpropagation updating module, we implement the other three modules with the most straightforward choices. Experiments on four public datasets validate the effectiveness and efficiency of our model in terms of improved recommendation performance and reduced computation overload over the state-of-the-art competitors.
Information Retrieval
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in the two - tower recommendation model, the existing training strategies treat users and items equally in the back - propagation update stage, that is, simultaneously perform gradient back - propagation updates on the representations of users and items. This approach is based on an implicit assumption that users and items should be treated equally during the training stage. However, the author believes that this assumption may not be suitable for solving the One - Class Collaborative Filtering (OCCF) problem. Specifically, since users' interests may be diverse and these interests may be distributed among different latent types, if the traditional double - back - propagation strategy is used, it may cause the user representation to be overly biased towards those items of the same type while ignoring items of other types, thus affecting the recommendation effect. To solve this problem, the paper proposes a new training strategy - One - time Back - Propagation (OneBP). Its core idea is to maintain the normal gradient back - propagation of the item - encoding tower, but cut off the gradient back - propagation of the user - encoding tower. Instead, the author proposes a moving - aggregation strategy to update the user's representation. Specifically, in each training cycle, the user's representation is updated by aggregating its previous representation and the latest representation of the currently interacting item. This strategy aims to better capture users' diverse interests while reducing the computational burden. Through experiments on four public datasets, the paper verifies the superiority of the proposed OneBP model in terms of recommendation performance and computational efficiency. The experimental results show that compared with the existing state - of - the - art methods, OneBP not only improves the recommendation performance but also shows significant advantages in computational time.