Abstract:Test-time domain adaptation aims to adapt the model trained on source domains to unseen target domains using a few unlabeled images. Emerging research has shown that the label and domain information is separately embedded in the weight matrix and batch normalization (BN) layer. Previous works normally update the whole network naively without explicitly decoupling the knowledge between label and domain. As a result, it leads to knowledge interference and defective distribution adaptation. In this work, we propose to reduce such learning interference and elevate the domain knowledge learning by only manipulating the BN layer. However, the normalization step in BN is intrinsically unstable when the statistics are re-estimated from a few samples. We find that ambiguities can be greatly reduced when only updating the two affine parameters in BN while keeping the source domain statistics. To further enhance the domain knowledge extraction from unlabeled data, we construct an auxiliary branch with label-independent self-supervised learning (SSL) to provide supervision. Moreover, we propose a bi-level optimization based on meta-learning to enforce the alignment of two learning objectives of auxiliary and main branches. The goal is to use the auxiliary branch to adapt the domain and benefit main task for subsequent inference. Our method keeps the same computational cost at inference as the auxiliary branch can be thoroughly discarded after adaptation. Extensive experiments show that our method outperforms the prior works on five WILDS real-world domain shift datasets. Our method can also be integrated with methods with label-dependent optimization to further push the performance boundary. Our code is available at <a class="link-external link-https" href="https://github.com/ynanwu/MABN" rel="external noopener nofollow">this https URL</a>.

Towards Test Time Domain Adaptation Via Negative Label Smoothing

Learning label smoothing for text classification

Improving Time Series Classification with Representation Soft Label Smoothing

Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Rethinking Precision of Pseudo Label: Test-Time Adaptation Via Complementary Learning

Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation

Test-time Adaptation for Regression by Subspace Alignment

Confidence-based and sample-reweighted test-time adaptation

Domain-Specific Block Selection and Paired-View Pseudo-Labeling for Online Test-Time Adaptation

Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation

Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments

Feature Alignment and Uniformity for Test Time Adaptation

Beyond Invariance: Test-Time Label-Shift Adaptation for Distributions with "Spurious" Correlations

Test-Time Model Adaptation for Visual Question Answering with Debiased Self-Supervisions

Improved Test-Time Adaptation for Domain Generalization

Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation

Robust Long-Tailed Learning under Label Noise

Test-time adaptation for image compression with distribution regularization

ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance