Table-GPT: Table-tuned GPT for Diverse Table Tasks

Peng Li,Yeye He,Dror Yashar,Weiwei Cui,Song Ge,Haidong Zhang,Danielle Rifinski Fainman,Dongmei Zhang,Surajit Chaudhuri
2023-10-14
Abstract:Language models, such as GPT-3.5 and ChatGPT, demonstrate remarkable abilities to follow diverse human instructions and perform a wide range of tasks. However, when probing language models using a range of basic table-understanding tasks, we observe that today's language models are still sub-optimal in many table-related tasks, likely because they are pre-trained predominantly on \emph{one-dimensional} natural-language texts, whereas relational tables are \emph{two-dimensional} objects. In this work, we propose a new "\emph{table-tuning}" paradigm, where we continue to train/fine-tune language models like GPT-3.5 and ChatGPT, using diverse table-tasks synthesized from real tables as training data, with the goal of enhancing language models' ability to understand tables and perform table tasks. We show that our resulting Table-GPT models demonstrate (1) better \emph{table-understanding} capabilities, by consistently outperforming the vanilla GPT-3.5 and ChatGPT, on a wide-range of table tasks, including holdout unseen tasks, and (2) strong \emph{generalizability}, in its ability to respond to diverse human instructions to perform new table-tasks, in a manner similar to GPT-3.5 and ChatGPT.
Computation and Language,Artificial Intelligence,Databases
What problem does this paper attempt to address?
The paper primarily addresses the limitations of current large language models (such as GPT-3.5 and ChatGPT) in handling tabular data by proposing a new approach. The authors observed that although these models perform excellently on natural language tasks, they still have shortcomings in understanding tabular data. This is mainly because they are pre-trained on one-dimensional natural language text, whereas tabular data is inherently two-dimensional. To solve this problem, the paper proposes a new model called "Table-GPT," which enhances the language model's ability to understand tables through a method called "table-tuning." Specifically, this method involves continuing the training of existing language models (e.g., GPT-3.5 or ChatGPT) using a variety of tabular tasks synthesized from real tables as training data. The goal is to improve the model's ability to understand and execute table-related tasks. The main contributions of the paper are as follows: 1. **Proposed a new "table-tuning" paradigm**: Utilizing a large number of diverse tabular tasks synthesized from real tables to continue training the language model, enhancing its ability to handle tabular data. 2. **Developed various data augmentation techniques**: Including task-level, table-level, instruction-level, and completion-level data augmentation techniques to ensure the generalization ability of the Table-GPT model and avoid overfitting. 3. **Demonstrated the strong performance of the Table-GPT model**: It significantly outperformed the non-tuned GPT-3.5 and ChatGPT models on various known and unknown tabular tasks and adapted well to new tasks, similar to ChatGPT's flexibility in handling natural language tasks. In summary, this paper aims to improve the ability of language models to handle tabular data and experimentally validates the effectiveness and superiority of the proposed Table-GPT model.