Abstract:Text-to-Table aims to generate structured tables to convey the key information from unstructured documents. Existing text-to-table datasets are typically oriented English, limiting the research in non-English languages. Meanwhile, the emergence of large language models (LLMs) has shown great success as general task solvers in multi-lingual settings (e.g., ChatGPT), theoretically enabling text-to-table in other languages. In this paper, we propose a Chinese text-to-table dataset, CT-Eval, to benchmark LLMs on this task. Our preliminary analysis of English text-to-table datasets highlights two key factors for dataset construction: data diversity and data hallucination. Inspired by this, the CT-Eval dataset selects a popular Chinese multidisciplinary online encyclopedia as the source and covers 28 domains to ensure data diversity. To minimize data hallucination, we first train an LLM to judge and filter out the task samples with hallucination, then employ human annotators to clean the hallucinations in the validation and testing sets. After this process, CT-Eval contains 88.6K task samples. Using CT-Eval, we evaluate the performance of open-source and closed-source LLMs. Our results reveal that zero-shot LLMs (including GPT-4) still have a significant performance gap compared with human judgment. Furthermore, after fine-tuning, open-source LLMs can significantly improve their text-to-table ability, outperforming GPT-4 by a large margin. In short, CT-Eval not only helps researchers evaluate and quickly understand the Chinese text-to-table ability of existing LLMs but also serves as a valuable resource to significantly improve the text-to-table performance of LLMs.

The ICT , CAS MT Systems for the IWSLT 09 Evaluation

The ICT System Description for IWSLT 2008

The ICT Statistical Machine Translation Systems for IWSLT 2007.

The TCH Machine Translation System for IWSLT 2008.

ICT System Description for the 2006 TC-STAR Run #2 SLT Evaluation

ISCAS at NTCIR-3: Monolingual, Bilingual and MultiLingual IR Tasks

The MSRA Machine Translation System for IWSLT 2010.

The Sogou-TIIC Speech Translation System for IWSLT 2018.

LIT Team's System Description for Japanese-Chinese Machine Translation Task in IWSLT 2020.

WIA-Opinmine System in NTCIR-8 MOAT Evaluation.

Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

SG01 at the NTCIR-13 STC-2 Task

The WISTON Text to Speech System for Blizzard 2008

I4U System Description for NIST SRE'20 CTS Challenge

CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task

An In-depth Study on Internal Structure of Chinese Words

CMU's IWSLT 2024 Simultaneous Speech Translation System

The USTC-NEL Speech Translation system at IWSLT 2018

Multi-choice Question Answering System of WIP at the NTIR-12 QA Lab-2.

An Improved Traditional Chinese Evaluation Suite for Foundation Model