Abstract:Deep learning has been widely applied in recommender systems, which has achieved revolutionary progress recently. However, most existing learning-based methods assume that the user and item distributions remain unchanged between the training phase and the test phase. However, the distribution of user and item features can naturally shift in real-world scenarios, potentially resulting in a substantial decrease in recommendation performance. This phenomenon can be formulated as an Out-Of-Distribution (OOD) recommendation problem. To address this challenge, we propose a novel Dual Test-Time-Training framework for OOD Recommendation, termed DT3OR. In DT3OR, we incorporate a model adaptation mechanism during the test-time phase to carefully update the recommendation model, allowing the model to specially adapt to the shifting user and item features. To be specific, we propose a self-distillation task and a contrastive task to assist the model learning both the user's invariant interest preferences and the variant user/item characteristics during the test-time phase, thus facilitating a smooth adaptation to the shifting features. Furthermore, we provide theoretical analysis to support the rationale behind our dual test-time training framework. To the best of our knowledge, this paper is the first work to address OOD recommendation via a test-time-training strategy. We conduct experiments on three datasets with various backbones. Comprehensive experimental results have demonstrated the effectiveness of DT3OR compared to other state-of-the-art baselines.

What problem does this paper attempt to address?

This paper primarily addresses a specific issue in recommendation systems—Out-Of-Distribution (OOD) recommendation. Specifically, the paper focuses on how to improve the performance of recommendation systems when the feature distribution of users and items changes between the training and testing phases. ### Research Background Most existing learning-based methods assume that the distribution of users and items remains unchanged from the training phase to the testing phase (i.e., Independent and Identically Distributed, IID). However, in the real world, these distributions often change, leading to a decline in recommendation performance. This phenomenon is known as the OOD recommendation problem. ### Solution To address the OOD recommendation problem, the authors propose a new framework called DT3OR (Dual Test-Time-Training for Out-of-distribution Recommender System). The core idea is to use self-supervised learning tasks during the testing phase to update the recommendation model, enabling it to adapt to changes in user and item features. #### Main Contributions 1. **DT3OR** is the first work to address the OOD recommendation problem through test-time training. 2. A dual test-time training strategy is proposed, which includes two self-supervised learning tasks: a self-distillation task and a contrastive task, aimed at learning invariant user preference relationships and adapting the model to changing user and item features. 3. The proposed dual test-time training strategy is theoretically analyzed, proving its rationality. 4. Experimental results on three datasets show that DT3OR outperforms other state-of-the-art recommendation algorithms in OOD scenarios. ### Method Overview - **Self-distillation task**: Aims to minimize the distance between the interest centers of users with the same preferences, enhancing the uniformity of user interest representations in the latent space. - **Contrastive task**: Establishes meaningful inter-user relationships by selecting high-confidence samples from the same user interest center, thereby capturing associations between users with similar preferences. ### Conclusion This paper proposes the DT3OR framework, which effectively addresses the OOD problem in recommendation systems. Theoretical analysis and experiments demonstrate its effectiveness. This method is significant for improving the robustness and generalization ability of recommendation systems when facing changes in user and item features.

Dual Test-time Training for Out-of-distribution Recommender System

Co-training Disentangled Domain Adaptation Network for Leveraging Popularity Bias in Recommenders

User Distribution Mapping Modelling with Collaborative Filtering for Cross Domain Recommendation

Cross-Domain Causal Preference Learning for Out-of-Distribution Recommendation

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

Out-of-Distribution Learning with Human Feedback

TTT4Rec: A Test-Time Training Approach for Rapid Adaption in Sequential Recommendation

Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation

Dual-Adapter: Training-free Dual Adaptation for Few-shot Out-of-Distribution Detection

CausPref: Causal Preference Learning for Out-of-Distribution Recommendation.

TADO: Time-Varying Attention with Dual-Optimizer Model

Out-of-distribution Rumor Detection via Test-Time Adaptation

A Generic Learning Framework for Sequential Recommendation with Distribution Shifts

Equivariant Learning for Out-of-Distribution Cold-start Recommendation

Improving Rating Predictions with Time-Varying Attention and Dual-Optimizer

Causal Representation Learning for Out-of-Distribution Recommendation

Three Factors to Improve Out-of-Distribution Detection

Towards In-Distribution Compatible Out-of-Distribution Detection.

Open-World Test-Time Training: Self-Training with Contrast Learning

On the Robustness of Open-World Test-Time Training: Self-Training with Dynamic Prototype Expansion

Dual Preference Distribution Learning for Item Recommendation