Dual Test-time Training for Out-of-distribution Recommender System

Xihong Yang,Yiqi Wang,Jin Chen,Wenqi Fan,Xiangyu Zhao,En Zhu,Xinwang Liu,Defu Lian
2024-07-22
Abstract:Deep learning has been widely applied in recommender systems, which has achieved revolutionary progress recently. However, most existing learning-based methods assume that the user and item distributions remain unchanged between the training phase and the test phase. However, the distribution of user and item features can naturally shift in real-world scenarios, potentially resulting in a substantial decrease in recommendation performance. This phenomenon can be formulated as an Out-Of-Distribution (OOD) recommendation problem. To address this challenge, we propose a novel Dual Test-Time-Training framework for OOD Recommendation, termed DT3OR. In DT3OR, we incorporate a model adaptation mechanism during the test-time phase to carefully update the recommendation model, allowing the model to specially adapt to the shifting user and item features. To be specific, we propose a self-distillation task and a contrastive task to assist the model learning both the user's invariant interest preferences and the variant user/item characteristics during the test-time phase, thus facilitating a smooth adaptation to the shifting features. Furthermore, we provide theoretical analysis to support the rationale behind our dual test-time training framework. To the best of our knowledge, this paper is the first work to address OOD recommendation via a test-time-training strategy. We conduct experiments on three datasets with various backbones. Comprehensive experimental results have demonstrated the effectiveness of DT3OR compared to other state-of-the-art baselines.
Information Retrieval,Machine Learning
What problem does this paper attempt to address?
This paper primarily addresses a specific issue in recommendation systems—Out-Of-Distribution (OOD) recommendation. Specifically, the paper focuses on how to improve the performance of recommendation systems when the feature distribution of users and items changes between the training and testing phases. ### Research Background Most existing learning-based methods assume that the distribution of users and items remains unchanged from the training phase to the testing phase (i.e., Independent and Identically Distributed, IID). However, in the real world, these distributions often change, leading to a decline in recommendation performance. This phenomenon is known as the OOD recommendation problem. ### Solution To address the OOD recommendation problem, the authors propose a new framework called DT3OR (Dual Test-Time-Training for Out-of-distribution Recommender System). The core idea is to use self-supervised learning tasks during the testing phase to update the recommendation model, enabling it to adapt to changes in user and item features. #### Main Contributions 1. **DT3OR** is the first work to address the OOD recommendation problem through test-time training. 2. A dual test-time training strategy is proposed, which includes two self-supervised learning tasks: a self-distillation task and a contrastive task, aimed at learning invariant user preference relationships and adapting the model to changing user and item features. 3. The proposed dual test-time training strategy is theoretically analyzed, proving its rationality. 4. Experimental results on three datasets show that DT3OR outperforms other state-of-the-art recommendation algorithms in OOD scenarios. ### Method Overview - **Self-distillation task**: Aims to minimize the distance between the interest centers of users with the same preferences, enhancing the uniformity of user interest representations in the latent space. - **Contrastive task**: Establishes meaningful inter-user relationships by selecting high-confidence samples from the same user interest center, thereby capturing associations between users with similar preferences. ### Conclusion This paper proposes the DT3OR framework, which effectively addresses the OOD problem in recommendation systems. Theoretical analysis and experiments demonstrate its effectiveness. This method is significant for improving the robustness and generalization ability of recommendation systems when facing changes in user and item features.