Tabular Transfer Learning via Prompting LLMs

Jaehyun Nam,Woomin Song,Seong Hyeon Park,Jihoon Tack,Sukmin Yun,Jaehyung Kim,Kyu Hwan Oh,Jinwoo Shin
2024-08-09
Abstract:Learning with a limited number of labeled data is a central problem in real-world applications of machine learning, as it is often expensive to obtain annotations. To deal with the scarcity of labeled data, transfer learning is a conventional approach; it suggests to learn a transferable knowledge by training a neural network from multiple other sources. In this paper, we investigate transfer learning of tabular tasks, which has been less studied and successful in the literature, compared to other domains, e.g., vision and language. This is because tables are inherently heterogeneous, i.e., they contain different columns and feature spaces, making transfer learning difficult. On the other hand, recent advances in natural language processing suggest that the label scarcity issue can be mitigated by utilizing in-context learning capability of large language models (LLMs). Inspired by this and the fact that LLMs can also process tables within a unified language space, we ask whether LLMs can be effective for tabular transfer learning, in particular, under the scenarios where the source and target datasets are of different format. As a positive answer, we propose a novel tabular transfer learning framework, coined Prompt to Transfer (P2T), that utilizes unlabeled (or heterogeneous) source data with LLMs. Specifically, P2T identifies a column feature in a source dataset that is strongly correlated with a target task feature to create examples relevant to the target task, thus creating pseudo-demonstrations for prompts. Experimental results demonstrate that P2T outperforms previous methods on various tabular learning benchmarks, showing good promise for the important, yet underexplored tabular transfer learning problem. Code is available at <a class="link-external link-https" href="https://github.com/jaehyun513/P2T" rel="external noopener nofollow">this https URL</a>.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address a core issue in the field of machine learning: how to effectively learn with limited labeled data. Specifically, the paper focuses on the problem of transfer learning for tabular data, as tabular data typically contains different columns and feature spaces, making traditional transfer learning methods difficult to apply. ### Main Contributions 1. **Proposing the P2T Framework**: - **Background**: Although there has been extensive research on transfer learning in fields such as vision and language, studies on tabular data are relatively scarce. The heterogeneity of tabular data (different columns and feature spaces) makes traditional transfer learning methods hard to apply directly. - **Solution**: The paper proposes a new framework called "Prompt to Transfer" (P2T), which leverages the contextual learning capabilities of large language models (LLMs) to address the transfer learning problem for tabular data. P2T identifies the most relevant column features from the source dataset for the target task and creates pseudo-demonstrations, thereby effectively transferring knowledge to the target task. 2. **Experimental Validation**: - **Zero-Shot Learning**: In zero-shot learning scenarios, P2T significantly improves prediction performance, especially when using unlabeled or heterogeneous data as the transfer source. - **Few-Shot Learning**: In few-shot learning scenarios, P2T also performs excellently, further enhancing model prediction accuracy by utilizing unlabeled or heterogeneous data. 3. **Comparative Experiments**: - Compared to other existing methods (such as self-supervised learning methods, unsupervised meta-learning methods, etc.), P2T achieves better results in various benchmark tests, particularly in leveraging unlabeled and heterogeneous data. ### Conclusion By proposing the P2T framework, this paper addresses key issues in transfer learning for tabular data and demonstrates the potential of large language models in scenarios with limited labeled data. Experimental results show that P2T performs excellently in various scenarios, providing new insights for transfer learning in tabular data.