Modeling User Retention through Generative Flow Networks

Ziru Liu,Shuchang Liu,Bin Yang,Zhenghai Xue,Qingpeng Cai,Xiangyu Zhao,Zijian Zhang,Lantao Hu,Han Li,Peng Jiang
DOI: https://doi.org/10.1145/3637528.3671531
2024-06-10
Abstract:Recommender systems aim to fulfill the user's daily demands. While most existing research focuses on maximizing the user's engagement with the system, it has recently been pointed out that how frequently the users come back for the service also reflects the quality and stability of recommendations. However, optimizing this user retention behavior is non-trivial and poses several challenges including the intractable leave-and-return user activities, the sparse and delayed signal, and the uncertain relations between users' retention and their immediate feedback towards each item in the recommendation list. In this work, we regard the retention signal as an overall estimation of the user's end-of-session satisfaction and propose to estimate this signal through a probabilistic flow. This flow-based modeling technique can back-propagate the retention reward towards each recommended item in the user session, and we show that the flow combined with traditional learning-to-rank objectives eventually optimizes a non-discounted cumulative reward for both immediate user feedback and user retention. We verify the effectiveness of our method through both offline empirical studies on two public datasets and online A/B tests in an industrial platform.
Information Retrieval
What problem does this paper attempt to address?
The paper aims to address the issue of user retention optimization in recommendation systems. Although most existing research focuses on maximizing user interaction with the system (such as click-through rate, likes, etc.), recent studies have pointed out that the frequency of users returning to use the service also reflects the quality and stability of recommendations. However, optimizing this user retention behavior presents several challenges, including the difficulty of tracking user departure and return behavior, sparse and delayed signals, and the uncertain relationship between user retention and their immediate feedback on each item in the recommendation list. To tackle these challenges, the authors propose a method based on Generative Flow Networks (GFNs) — GFN4Retention. This method views retention signals as an overall estimate of user satisfaction at the end of a session and models this signal through probabilistic flow. This approach can backpropagate retention rewards to each recommended item in the user session and demonstrates that combining traditional learning-to-rank objectives can ultimately optimize both immediate user feedback and the non-discounted cumulative reward of user retention. Additionally, the paper validates the effectiveness of this method through offline empirical studies on 2 public datasets and online A/B tests on an industrial platform.