Abstract:Tabular data plays a vital role in various real-world scenarios and finds extensive applications. Although recent deep tabular models have shown remarkable success, they still struggle to handle data distribution shifts, leading to performance degradation when testing distributions change. To remedy this, a robust tabular model must adapt to generalize to unknown distributions during testing. In this paper, we investigate the problem of fully test-time adaptation (FTTA) for tabular data, where the model is adapted using only the testing data. We identify three key challenges: the existence of label and covariate distribution shifts, the lack of effective data augmentation, and the sensitivity of adaptation, which render existing FTTA methods ineffective for tabular data. To this end, we propose the Fully Test-time Adaptation for Tabular data, namely FTAT, which enables FTTA methods to robustly optimize the label distribution of predictions, adapt to shifted covariate distributions, and suit a variety of tasks and models effectively. We conduct comprehensive experiments on six benchmark datasets, which are evaluated using three metrics. The experimental results demonstrate that FTAT outperforms state-of-the-art methods by a margin.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the performance degradation of existing deep - learning models in tabular data when the distribution of test data changes. Specifically, the authors focus on the Fully Test - time Adaptation (FTTA) problem, that is, using only test data to adjust the pre - trained deep tabular model without source training data, so that it can better generalize to unknown data distributions. Here are several key points in the paper: 1. **Existing challenges**: - **Label and covariate distribution shift**: There are changes in the label distribution and covariate distribution in tabular data, which lead to the ineffectiveness of existing FTTA methods. - **Lack of effective data augmentation**: Traditional test - time adaptation methods rely on data augmentation, but for tabular data, these methods have limited effectiveness. - **Sensitivity to tasks and models**: The task and model selection of tabular data are very sensitive to the adaptation process, and optimization needs to be carried out for specific tasks and models. 2. **Proposed method**: To address the above challenges, the authors propose a new FTTA method - F TAT (Fully Test - time Adaptation for Tabular data), which consists of three modules: - **Confident Distribution Optimizer**: Estimate and optimize the label distribution through low - entropy prediction data. - **Local Consistent Weighter**: Filter low - quality predictions by calculating the prediction consistency between data points and their neighborhoods, and ensure robust test - time adaptation. - **Dynamic Model Ensembler**: Integrate multiple models with different learning rates online to obtain more robust predictions. 3. **Experimental results**: The authors conducted extensive experiments on six benchmark datasets to evaluate the performance of the F TAT method relative to other existing FTTA methods. The experimental results show that F TAT significantly outperforms other methods in all metrics, especially in dealing with label and covariate distribution shift. ### Summary The main contribution of this paper lies in identifying the three major challenges faced by FTTA in tabular data and proposing a new method, F TAT, which uses three specially designed modules to address these challenges. The experimental results show that F TAT has superior performance on multiple tasks and models, providing an effective solution to the distribution shift problem in tabular data.

Fully Test-time Adaptation for Tabular Data

AdapTable: Test-Time Adaptation for Tabular Data via Shift-Aware Uncertainty Calibrator and Label Distribution Handler

A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts

On Pitfalls of Test-Time Adaptation

Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams

Feature Augmentation based Test-Time Adaptation

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

Reliable Test-Time Adaptation via Agreement-on-the-Line

Robust Test-Time Adaptation in Dynamic Scenarios

DATTA: Towards Diversity Adaptive Test-Time Adaptation in Dynamic Wild World

Universal Test-time Adaptation through Weight Ensembling, Diversity Weighting, and Prior Correction

Bag of Tricks for Fully Test-Time Adaptation

Active Test-Time Adaptation: Theoretical Analyses and An Algorithm

Label Shift Adapter for Test-Time Adaptation under Covariate and Label Shifts

Improved Test-Time Adaptation for Domain Generalization

MITA: Bridging the Gap between Model and Data for Test-time Adaptation

Adaptive Test-Time Personalization for Federated Learning

Learning to Adapt to Online Streams with Distribution Shifts

AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation

Test-time Adaptation in the Dynamic World with Compound Domain Knowledge Management

Evaluation of Test-Time Adaptation Under Computational Time Constraints