Are LLMs Effective Backbones for Fine-tuning? An Experimental Investigation of Supervised LLMs on Chinese Short Text Matching

Shulin Liu,Chengcheng Xu,Hao Liu,Tinghao Yu,Tao Yang
2024-03-29
Abstract:The recent success of Large Language Models (LLMs) has garnered significant attention in both academia and industry. Prior research on LLMs has primarily focused on enhancing or leveraging their generalization capabilities in zero- and few-shot settings. However, there has been limited investigation into effectively fine-tuning LLMs for a specific natural language understanding task in supervised settings. In this study, we conduct an experimental analysis by fine-tuning LLMs for the task of Chinese short text matching. We explore various factors that influence performance when fine-tuning LLMs, including task modeling methods, prompt formats, and output formats.
Computation and Language
What problem does this paper attempt to address?
The paper discusses the effectiveness of using large language models (LLMs) for fine-tuning specific natural language understanding tasks, especially Chinese short text matching, in a supervised setting. The study analyzes the impact of factors such as task modeling approaches, prompt formats, and output formats on the fine-tuning performance through experiments. The results show that generative paradigms outperform discriminative paradigms, chain-of-thought (CoT) is beneficial for matching tasks with limited data, and LLMs are insensitive to prompts in supervised learning.