Integrating Planning into Single-Turn Long-Form Text Generation

Yi Liang,You Wu,Honglei Zhuang,Li Chen,Jiaming Shen,Yiling Jia,Zhen Qin,Sumit Sanghai,Xuanhui Wang,Carl Yang,Michael Bendersky

2024-10-09

Abstract:Generating high-quality, in-depth textual documents, such as academic papers, news articles, Wikipedia entries, and books, remains a significant challenge for Large Language Models (LLMs). In this paper, we propose to use planning to generate long form content. To achieve our goal, we generate intermediate steps via an auxiliary task that teaches the LLM to plan, reason and structure before generating the final text. Our main novelty lies in a single auxiliary task that does not require multiple rounds of prompting or planning. To overcome the scarcity of training data for these intermediate steps, we leverage LLMs to generate synthetic intermediate writing data such as outlines, key information and summaries from existing full articles. Our experiments demonstrate on two datasets from different domains, namely the scientific news dataset SciNews and Wikipedia datasets in KILT-Wiki and FreshWiki, that LLMs fine-tuned with the auxiliary task generate higher quality documents. We observed +2.5% improvement in ROUGE-Lsum, and a strong 3.60 overall win/loss ratio via human SxS evaluation, with clear wins in organization, relevance, and verifiability.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to use large - language models (LLMs) to generate high - quality, in - depth long - text documents, such as academic papers, news articles, Wikipedia entries, and books. Although existing large - language models have made remarkable progress in various text - generation tasks, generating coherent and substantial long - form documents remains a major challenge. To this end, the paper proposes a new method to simplify the generation process and enhance the consistency of documents by directly fine - tuning LLMs in a single call to generate the entire long - form document. Specifically, the main contributions of the paper are as follows: 1. **Proposing a new method**: This method directly fine - tunes LLMs to generate complete long - form documents in one call, simplifying the generation process and enhancing the consistency of documents. 2. **Introducing the pre - writing stage**: Inspired by human writing practices, the proposed framework teaches LLMs to plan and structure documents before generating the final text by introducing auxiliary training tasks. 3. **Generating synthetic intermediate writing data**: To overcome the challenge of limited training data in intermediate writing steps, use the capabilities of LLMs to generate synthetic summaries, outlines, and key information from existing complete articles. This innovative method provides rich training data for the learning of LLMs. 4. **Experimental results**: Extensive experimental results show that LLMs fine - tuned with auxiliary tasks generate higher - quality and more coherent long - form documents on multiple datasets. Through these contributions, the paper aims to improve the ability of LLMs to generate high - quality long - form documents, making them more suitable for practical applications.

Integrating Planning into Single-Turn Long-Form Text Generation

Learning to Plan with Natural Language

Learning to Plan by Updating Natural Language

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example

Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models

On the Planning Abilities of Large Language Models : A Critical Investigation

Understanding the Capabilities of Large Language Models for Automated Planning

Guiding Language Model Reasoning with Planning Tokens

Exploring and Benchmarking the Planning Capabilities of Large Language Models

Planning-Driven Programming: A Large Language Model Programming Workflow

Translating Natural Language to Planning Goals with Large-Language Models

Query-Efficient Planning with Language Models

Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark)

Large Language Models are Learnable Planners for Long-Term Recommendation

Large Language Models as Planning Domain Generators

ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning

EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios