Efficient Prompting Methods for Large Language Models: A Survey

Kaiyan Chang,Songcheng Xu,Chenglong Wang,Yingfeng Luo,Tong Xiao,Jingbo Zhu

2024-04-01

Abstract:Prompting has become a mainstream paradigm for adapting large language models (LLMs) to specific natural language processing tasks. While this approach opens the door to in-context learning of LLMs, it brings the additional computational burden of model inference and human effort of manual-designed prompts, particularly when using lengthy and complex prompts to guide and control the behavior of LLMs. As a result, the LLM field has seen a remarkable surge in efficient prompting methods. In this paper, we present a comprehensive overview of these methods. At a high level, efficient prompting methods can broadly be categorized into two approaches: prompting with efficient computation and prompting with efficient design. The former involves various ways of compressing prompts, and the latter employs techniques for automatic prompt optimization. We present the basic concepts of prompting, review the advances for efficient prompting, and highlight future research directions.

Computation and Language

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve This paper aims to address the efficiency issues faced when using prompts in specific natural language processing tasks with large language models (LLMs). Specifically, the paper focuses on the following two main problems: 1. **Computational Burden**: Long and complex prompts increase the computational cost of model inference. The model's performance is particularly affected when it needs to handle a large amount of contextual information. 2. **Manual Design Cost**: Designing high-quality prompts manually requires a significant amount of time and effort, especially when creating complex and detailed prompts. To address these issues, the paper provides a comprehensive review of existing efficient prompting methods and categorizes them into two main types: - **Efficient Computational Prompts**: These methods aim to reduce the consumption of computational resources by compressing prompts. Techniques include knowledge distillation, encoding, and filtering. - **Efficient Design Prompts**: These methods aim to improve efficiency by automatically optimizing prompt design. Techniques include gradient-based methods and intelligent algorithm-based methods. Through these methods, the paper hopes to provide researchers and developers with effective strategies to save financial and human resources, thereby promoting the widespread use of large language models in academic research and commercial applications.

Efficient Prompting Methods for Large Language Models: A Survey

Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey

A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models

Prompt Compression for Large Language Models: A Survey

A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

Automatic Prompt Selection for Large Language Models

A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

Visual Prompting in Multimodal Large Language Models: A Survey

Prompting Frameworks for Large Language Models: A Survey

Are Large Language Models Good Prompt Optimizers?

Prompting Is Programming: A Query Language for Large Language Models

Towards Generalist Prompting for Large Language Models by Mental Models

The language of prompting: What linguistic properties make a prompt successful?

A Survey on Prompting Techniques in LLMs

Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts

PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models

Exploring Prompt Engineering Practices in the Enterprise

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

SPRIG: Improving Large Language Model Performance by System Prompt Optimization