Abstract:Fine-tuning large language models (LLMs) with a collection of large and diverse instructions has improved the model's generalization to different tasks, even for unseen tasks. However, most existing instruction datasets include only single instructions, and they struggle to follow complex instructions composed of multiple subtasks. In this work, we propose a novel concept of compositional instructions called chain-of-instructions (CoI), where the output of one instruction becomes an input for the next like a chain. Unlike the conventional practice of solving single instruction tasks, our proposed method encourages a model to solve each subtask step by step until the final answer is reached. CoI-tuning (i.e., fine-tuning with CoI instructions) improves the model's ability to handle instructions composed of multiple subtasks as well as unseen composite tasks such as multilingual summarization. Overall, our study find that simple CoI tuning of existing instruction data can provide consistent generalization to solve more complex, unseen, and longer chains of instructions.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the insufficient ability of large - language models (LLMs) in handling complex instructions. Specifically, most of the existing instruction datasets only contain single instructions, and these models have difficulty in handling complex instructions composed of multiple subtasks. Therefore, the paper proposes a new concept named "Chain - of - Instructions (CoI)", aiming to improve the model's ability to handle complex instructions through a series of inter - related subtasks, especially those unseen compound tasks, such as multilingual summarization. The main contributions of the paper include: 1. **Introducing the Chain - of - Instructions (CoI) task**: A new benchmark dataset is created to evaluate the ability of LLMs in following compound instructions. 2. **Developing a framework**: It can automatically construct compound instruction datasets, reducing the need for human supervision. This framework utilizes in - context learning in existing single - instruction datasets to generate CoI. 3. **Proposing a method**: It enables LLMs to solve compound tasks in an interpretable way, for example, generating outputs step by step when handling complex task chains. 4. **Experimental verification**: Through experiments and analysis, it is proved that the models tuned by CoI are not only superior to the baseline models in handling single instructions and sequential compound instructions, but also show higher performance in unseen longer - chain test sets and downstream tasks. Through these contributions, the paper aims to enhance the ability of LLMs in handling complex and compound instruction tasks, thereby expanding their potential in practical applications.

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Fine-tuning Large Language Models with Sequential Instructions

CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model

Demystifying Instruction Mixing for Fine-tuning Large Language Models

Contrastive Instruction Tuning

TaCIE: Enhancing Instruction Comprehension in Large Language Models through Task-Centred Instruction Evolution

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Scaling Instruction-Finetuned Language Models

Dual Instruction Tuning with Large Language Models for Mathematical Reasoning

CoMMIT: Coordinated Instruction Tuning for Multimodal Large Language Models

Instruction Tuning for Large Language Models: A Survey

BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models

Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

InstructCoder: Instruction Tuning Large Language Models for Code Editing

Instruction Mining: Instruction Data Selection for Tuning Large Language Models

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation