Abstract:Test-time adaptation (TTA) seeks to tackle potential distribution shifts between training and test data by adapting a given model w.r.t. any test sample. Although recent TTA has shown promising performance, we still face two key challenges: 1) prior methods perform backpropagation for each test sample, resulting in unbearable optimization costs to many applications; 2) while existing TTA can significantly improve the test performance on out-of-distribution data, they often suffer from severe performance degradation on in-distribution data after TTA (known as forgetting). To this end, we have proposed an Efficient Anti-Forgetting Test-Time Adaptation (EATA) method which develops an active sample selection criterion to identify reliable and non-redundant samples for test-time entropy minimization. To alleviate forgetting, EATA introduces a Fisher regularizer estimated from test samples to constrain important model parameters from drastic changes. However, in EATA, the adopted entropy loss consistently assigns higher confidence to predictions even for samples that are underlying uncertain, leading to overconfident predictions. To tackle this, we further propose EATA with Calibration (EATA-C) to separately exploit the reducible model uncertainty and the inherent data uncertainty for calibrated TTA. Specifically, we measure the model uncertainty by the divergence between predictions from the full network and its sub-networks, on which we propose a divergence loss to encourage consistent predictions instead of overconfident ones. To further recalibrate prediction confidence, we utilize the disagreement among predicted labels as an indicator of the data uncertainty, and then devise a min-max entropy regularizer to selectively increase and decrease prediction confidence for different samples. Experiments on image classification and semantic segmentation verify the effectiveness of our methods.

Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios

ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance

Robust Mean Teacher for Continual and Gradual Test-Time Adaptation

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation

Confidence-based and sample-reweighted test-time adaptation

Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors

Universal Test-time Adaptation through Weight Ensembling, Diversity Weighting, and Prior Correction

Robust Test-Time Adaptation in Dynamic Scenarios

Protected Test-Time Adaptation via Online Entropy Matching: A Betting Approach

Fully Test-time Adaptation by Entropy Minimization

Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting

Improving Entropy-Based Test-Time Adaptation from a Clustering View

Resilient Practical Test-Time Adaptation: Soft Batch Normalization Alignment and Entropy-driven Memory Bank

Unified Entropy Optimization for Open-Set Test-Time Adaptation

Unraveling Batch Normalization for Realistic Test-Time Adaptation

Reliable Test-Time Adaptation via Agreement-on-the-Line

Improved Test-Time Adaptation for Domain Generalization

On Pitfalls of Test-Time Adaptation

COME: Test-time adaption by Conservatively Minimizing Entropy

SoTTA: Robust Test-Time Adaptation on Noisy Data Streams

MedBN: Robust Test-Time Adaptation against Malicious Test Samples