Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Zilong Wang,Hao Zhang,Chun-Liang Li,Julian Martin Eisenschlos,Vincent Perot,Zifeng Wang,Lesly Miculicich,Yasuhisa Fujii,Jingbo Shang,Chen-Yu Lee,Tomas Pfister
2024-01-19
Abstract:Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with generic reasoning, table-based reasoning requires the extraction of underlying semantics from both free-form questions and semi-structured tabular data. Chain-of-Thought and its similar approaches incorporate the reasoning chain in the form of textual context, but it is still an open question how to effectively leverage tabular data in the reasoning chain. We propose the Chain-of-Table framework, where tabular data is explicitly used in the reasoning chain as a proxy for intermediate thoughts. Specifically, we guide LLMs using in-context learning to iteratively generate operations and update the table to represent a tabular reasoning chain. LLMs can therefore dynamically plan the next operation based on the results of the previous ones. This continuous evolution of the table forms a chain, showing the reasoning process for a given tabular problem. The chain carries structured information of the intermediate results, enabling more accurate and reliable predictions. Chain-of-Table achieves new state-of-the-art performance on WikiTQ, FeTaQA, and TabFact benchmarks across multiple LLM choices.
Computation and Language
What problem does this paper attempt to address?
The paper primarily addresses issues in the field of natural language processing, specifically focusing on the understanding and reasoning of tabular data. Specifically, the researchers propose a new framework called CHAIN-OF-TABLE, which aims to form a "table chain" by progressively manipulating tabular data to help large language models (LLMs) better understand and solve table-based tasks, such as table question answering and fact verification. The paper points out that traditional table understanding methods have some limitations, such as the inability to effectively utilize information within the table structure for reasoning. Therefore, the researchers propose a new approach by defining a series of table operations (such as adding columns, selecting rows, etc.) and dynamically generating these operations under the guidance of a given question, thereby gradually constructing a table chain that includes intermediate results. This approach allows LLMs to progressively modify and update the table according to the current question's requirements, making it easier to arrive at the correct answer. The method was experimentally validated on several benchmark datasets, including WikiTQ, FeTaQA, and TabFact. The results show that CHAIN-OF-TABLE outperforms several existing methods on these tasks, including text-based Chain-of-Thought reasoning methods and program-assisted methods (such as generating SQL queries or Python scripts). Additionally, the paper analyzes the impact of different lengths of operation chains on performance and discusses the method's performance on input tables of varying sizes. In summary, the goal of this paper is to introduce a new multi-step reasoning framework in table understanding tasks to improve the accuracy and reliability of large language models in solving complex table problems.