Abstract:Data organized in tabular format is ubiquitous in real-world applications, and users often craft tables with biased feature definitions and flexibly set prediction targets of their interests. Thus, a rapid development of a robust, effective, dataset-versatile, user-friendly tabular prediction approach is highly desired. While Gradient Boosting Decision Trees (GBDTs) and existing deep neural networks (DNNs) have been extensively utilized by professional users, they present several challenges for casual users, particularly: (i) the dilemma of model selection due to their different dataset preferences, and (ii) the need for heavy hyperparameter searching, failing which their performances are deemed inadequate. In this paper, we delve into this question: Can we develop a deep learning model that serves as a sure bet solution for a wide range of tabular prediction tasks, while also being user-friendly for casual users? We delve into three key drawbacks of deep tabular models, encompassing: (P1) lack of rotational variance property, (P2) large data demand, and (P3) over-smooth solution. We propose ExcelFormer, addressing these challenges through a semi-permeable attention module that effectively constrains the influence of less informative features to break the DNNs' rotational invariance property (for P1), data augmentation approaches tailored for tabular data (for P2), and attentive feedforward network to boost the model fitting capability (for P3). These designs collectively make ExcelFormer a sure bet solution for diverse tabular datasets. Extensive and stratified experiments conducted on real-world datasets demonstrate that our model outperforms previous approaches across diverse tabular data prediction tasks, and this framework can be friendly to casual users, offering ease of use without the heavy hyperparameter tuning. The codes are available at https://github.com/whatashot/excelformer.

Transfer Learning with Deep Tabular Models

A Survey on Deep Tabular Learning

Making Pre-trained Language Models Great on Tabular Prediction

Large Scale Transfer Learning for Tabular Data via Language Modeling

Revisiting Deep Learning Models for Tabular Data

Deep Learning with Tabular Data: A Self-supervised Approach

CARTE: Pretraining and Transfer for Tabular Learning

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Simple Modifications to Improve Tabular Neural Networks

Tabular Transformers for Modeling Multivariate Time Series

TabGSL: Graph Structure Learning for Tabular Data Prediction

PTab: Using the Pre-trained Language Model for Modeling Tabular Data

From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models

TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023

XTab: Cross-table Pretraining for Tabular Transformers

Tabular deep learning: a comparative study applied to multi-task genome-wide prediction

Can a Deep Learning Model Be a Sure Bet for Tabular Prediction?

Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Deep Feature Embedding for Tabular Data

TTNet: Tabular Transfer Network for Few-samples Prediction

HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular Datasets