Abstract:Tabular data, widely used in industries like healthcare, finance, and transportation, presents unique challenges for deep learning due to its heterogeneous nature and lack of spatial structure. This survey reviews the evolution of deep learning models for tabular data, from early fully connected networks (FCNs) to advanced architectures like TabNet, SAINT, TabTranSELU, and MambaNet. These models incorporate attention mechanisms, feature embeddings, and hybrid architectures to address tabular data complexities. TabNet uses sequential attention for instance-wise feature selection, improving interpretability, while SAINT combines self-attention and intersample attention to capture complex interactions across features and data points, both advancing scalability and reducing computational overhead. Hybrid architectures such as TabTransformer and FT-Transformer integrate attention mechanisms with multi-layer perceptrons (MLPs) to handle categorical and numerical data, with FT-Transformer adapting transformers for tabular datasets. Research continues to balance performance and efficiency for large datasets. Graph-based models like GNN4TDL and GANDALF combine neural networks with decision trees or graph structures, enhancing feature representation and mitigating overfitting in small datasets through advanced regularization techniques. Diffusion-based models like the Tabular Denoising Diffusion Probabilistic Model (TabDDPM) generate synthetic data to address data scarcity, improving model robustness. Similarly, models like TabPFN and Ptab leverage pre-trained language models, incorporating transfer learning and self-supervised techniques into tabular tasks. This survey highlights key advancements and outlines future research directions on scalability, generalization, and interpretability in diverse tabular data applications.

What problem does this paper attempt to address?

### Problems the paper attempts to solve The paper "A Survey on Deep Tabular Learning" aims to solve the following problems: 1. **Small sample size**: - Many tabular datasets are relatively small, especially when compared with large - scale image or text datasets, which causes complex deep - learning models to be prone to overfitting. - A small sample size makes it difficult for the model to learn generalized patterns and instead makes it more likely to remember the specific details of the training data. 2. **High - dimensionality**: - Tabular data usually contains many features, which may be sparse or irrelevant, making it difficult for the model to identify meaningful patterns. - High - dimensional data increases the complexity of the model, making it more difficult to train and optimize. 3. **Complex feature interactions**: - Unlike the local structures common in image or text data, feature interactions in tabular data are non - local and complex. - Such complex interaction relationships require more specialized architectures to be effectively captured, and traditional deep - learning models perform poorly in this regard. 4. **Heterogeneous feature types**: - Tabular data usually contains multiple data types, such as numerical, categorical, ordinal, textual, and even multimedia elements (such as images and emojis). - Handling these different types of features is a key challenge in improving the performance of deep - learning models on tabular data. 5. **Non - spatial relationships**: - Traditional deep - learning models (such as convolutional neural networks and recurrent neural networks) perform well in processing data with spatial or temporal structures (such as images and text), but tabular data lacks such inherent spatial or temporal structures. - The relative positions of features have no clear meaning, making traditional deep - learning models less effective in processing tabular data. 6. **Overfitting problem**: - Deep - learning models are prone to overfitting on datasets with small sample sizes, that is, the model fits the training data too precisely and performs poorly on new data. - Regularization techniques, transfer learning, and other methods need to be adopted to alleviate this problem. ### Solutions To address the above challenges, the paper reviews the progress in the field of deep learning for tabular data in recent years, especially the following types of models and techniques: 1. **Fully - connected networks (FCNs)**: - The earliest attempt, which, although simple, shows the potential of deep learning on tabular data. 2. **Attention mechanisms**: - For example, TabNet performs instance - level feature selection through a sequential attention mechanism, improving the interpretability of the model. - SAINT combines self - attention and inter - sample attention to capture complex interactions between features and data points, enhancing the model's scalability and computational efficiency. 3. **Hybrid architectures**: - For example, TabTransformer and FT - Transformer combine attention mechanisms with multi - layer perceptrons (MLPs) to handle categorical and numerical data and adapt to the characteristics of tabular datasets. - FT - Transformer shows excellent performance by adapting the Transformer model to handle tabular data. 4. **Graph neural networks (GNNs)**: - For example, GNN4TDL and GANDALF combine neural networks with decision trees or graph structures to enhance feature representations and alleviate the overfitting problem of small datasets through advanced regularization techniques. 5. **Diffusion models**: - For example, the Tabular Denoising Diffusion Probabilistic Model (TabDDPM) generates synthetic data to solve the data scarcity problem and improve the robustness of the model. 6. **Pre - trained language models**: - For example, TabPFN and Ptab utilize pre - trained language models to introduce transfer learning and self - supervision techniques into tabular tasks, improving the performance of the model. ### Conclusion By reviewing the development of these models and techniques, the paper summarizes the key progress in the field of deep learning for tabular data and points out future research directions, especially further exploration in terms of scalability, generalization ability, and interpretability. These innovations not only solve the unique challenges of tabular data but also provide new possibilities for the processing of complex, non - spatial data in practical applications.

A Survey on Deep Tabular Learning

Embeddings for Tabular Data: A Survey

Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions

Transfer Learning with Deep Tabular Models

TabularNet: A Neural Network Architecture for Understanding Semantic Structures of Tabular Data

Revisiting Deep Learning Models for Tabular Data

Deep Learning with Tabular Data: A Self-supervised Approach

MambaTab: A Plug-and-Play Model for Learning Tabular Data

Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution

Tabular Data: Is Attention All You Need?

Tabular Transformers for Modeling Multivariate Time Series

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

PTab: Using the Pre-trained Language Model for Modeling Tabular Data

TabGSL: Graph Structure Learning for Tabular Data Prediction

Stable and Interpretable Deep Learning for Tabular Data: Introducing InterpreTabNet with the Novel InterpreStability Metric

Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey

Deep Feature Embedding for Tabular Data

On the Efficiency of NLP-Inspired Methods for Tabular Deep Learning

InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

Time Sequence Deep Learning Model for Ubiquitous Tabular Data with Unique 3D Tensors Manipulation

DANets: Deep Abstract Networks for Tabular Data Classification and Regression