Abstract:Data-to-text (D2T) generation aims to generate human-readable text from semi-structured data, such as tables and graphs. The recent success of D2T is largely attributed to advancements in LLMs. Despite the success of LLMs, no research has been conducted to illustrate the impact of model size on the performance of fine-tuned LLMs for D2T tasks. D2T model performance is typically assessed based on three key qualities: \textit{readability} (indicates fluency and coherence), \textit{informativeness} (measures content similarity), and \textit{faithfulness} (assesses consistency of factual information). It is currently uncertain whether increasing the size of LLMs effectively improves performance in D2T tasks across these three qualities. The objective of this study is to investigate the performance of fine-tuned LLMs in D2T tasks in terms of model size. Through extensive comparative analysis, we aim to elucidate both the advantages and limitations of scaling model sizes across five widely used D2T datasets (E2E, ViGGo, WikiTableText, DART, and WebNLG) and twelve state-of-the-art LLMs with varying sizes from five different LLM families (T5, BART, OPT, BLOOM, and Llama 2). To comprehensively cover all the three essential qualities of D2T models, we incorporate six widely recognized automatic metrics -- \textsc{BLEU}, \textsc{METEOR}, \textsc{BERTScore}, \textsc{MoverScore}, \textsc{Parent}, and \textsc{BARTScore}. We also provide an in-depth analysis of LLM performance concerning model size in the presence of source-reference divergence, a critical aspect of D2T tasks. Our investigation reveals that increasing LLM size enhances \textit{readability} and \textit{informativeness} in D2T tasks, but larger (in terms of size) LLMs may sacrifice \textit{faithfulness}. Moreover, small-sized LLMs show more resilience than larger ones when source-reference divergence is present.

CMMQC: Cascaded Multi-Model Quality Control for Unsupervised Data-to-Text Generation

Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework

Text2Data: Low-Resource Data Generation with Textual Control

Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

A Semi-Supervised Approach for Low-Resourced Text Generation.

Teach LLMs to Personalize -- An Approach inspired by Writing Education

Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation

Controllable Text Generation for Large Language Models: A Survey

Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs

A Lightweight Multi Aspect Controlled Text Generation Solution For Large Language Models

Curriculum Learning with Quality-Driven Data Selection

Exploration of Masked and Causal Language Modelling for Text Generation

Partially-Aligned Data-to-Text Generation with Distant Supervision

Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation

Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding

How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?

TextMachina: Seamless Generation of Machine-Generated Text Datasets

Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation