From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

Qianyu He,Jie Zeng,Qianxi He,Jiaqing Liang,Yanghua Xiao

2024-06-18

Abstract:It is imperative for Large language models (LLMs) to follow instructions with elaborate requirements (i.e. Complex Instructions Following). Yet, it remains under-explored how to enhance the ability of LLMs to follow complex instructions with multiple constraints. To bridge the gap, we initially study what training data is effective in enhancing complex constraints following abilities. We found that training LLMs with instructions containing multiple constraints enhances their understanding of complex instructions, especially those with lower complexity levels. The improvement can even generalize to compositions of out-of-domain constraints. Additionally, we further propose methods addressing how to obtain and utilize the effective training data. Finally, we conduct extensive experiments to prove the effectiveness of our methods in terms of overall performance and training efficiency. We also demonstrate that our methods improve models' ability to follow instructions generally and generalize effectively across out-of-domain, in-domain, and adversarial settings, while maintaining general capabilities.

Computation and Language

What problem does this paper attempt to address?

The problem this paper attempts to address is: How to enhance the multi-constraint instruction-following ability of large language models (LLMs) when faced with complex instructions. Specifically, the paper explores the following two main questions: 1. **What kind of training data can effectively improve the ability to follow complex instructions?** - Through empirical research, the paper finds that training LLMs with instructions containing multiple constraints (i.e., composite data) is more effective in improving the model's understanding of complex instructions compared to training with instructions containing a single constraint (i.e., atomic data), especially when dealing with low-complexity instructions. 2. **How to obtain and utilize effective training data?** - The paper proposes a technique based on a discriminative method to generate high-quality composite data and introduces a contrastive method that uses positive and negative samples for reinforcement learning fine-tuning to further enhance the model's ability to follow complex instructions. Through these studies, the paper aims to systematically improve the ability of LLMs to handle complex instructions in practical applications, ensuring that the model maintains good generalization performance across various domains and adversarial settings.

From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Can Large Language Models Understand Real-World Complex Instructions?

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Divide-Verify-Refine: Aligning LLM Responses with Complex Instructions

Evaluating Large Language Models at Evaluating Instruction Following

Instruction Position Matters in Sequence Generation with Large Language Models

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Beyond Instruction Following: Evaluating Inferential Rule Following of Large Language Models

LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios

Nevermind: Instruction Override and Moderation in Large Language Models

Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data

WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

RNR: Teaching Large Language Models to Follow Roles and Rules