A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models

Yuanfeng Song,Yuanqin He,Xuefang Zhao,Hanlin Gu,Di Jiang,Haijun Yang,Lixin Fan,Qiang Yang
2023-10-24
Abstract:The springing up of Large Language Models (LLMs) has shifted the community from single-task-orientated natural language processing (NLP) research to a holistic end-to-end multi-task learning paradigm. Along this line of research endeavors in the area, LLM-based prompting methods have attracted much attention, partially due to the technological advantages brought by prompt engineering (PE) as well as the underlying NLP principles disclosed by various prompting methods. Traditional supervised learning usually requires training a model based on labeled data and then making predictions. In contrast, PE methods directly use the powerful capabilities of existing LLMs (i.e., GPT-3 and GPT-4) via composing appropriate prompts, especially under few-shot or zero-shot scenarios. Facing the abundance of studies related to the prompting and the ever-evolving nature of this field, this article aims to (i) illustrate a novel perspective to review existing PE methods, within the well-established communication theory framework; (ii) facilitate a better/deeper understanding of developing trends of existing PE methods used in four typical tasks; (iii) shed light on promising research directions for future PE methods.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: with the rise of large - language models (LLMs), how to fully utilize the powerful capabilities of these models through effective Prompt Engineering (PE) methods, especially in the few - shot or zero - shot scenarios. From the perspective of communication theory, the paper aims to: 1. **Provide a novel perspective**: Review existing prompt engineering techniques through the established communication theory framework to help understand the development trends of these techniques. 2. **Deepen the understanding of existing PE methods**: Especially the PE methods for four typical tasks, such as prompt template engineering, answer engineering and multi - round prompting methods. 3. **Explore future research directions**: Point out the possible future development paths of prompt engineering techniques to further reduce the information misunderstanding between users and LLMs and improve the performance of LLMs in various tasks. Specifically, by analyzing and summarizing different types of PE methods, such as prompt template engineering, answer engineering and multi - round prompting methods, the paper explores how to optimize these methods to better utilize the capabilities of LLMs, especially in the multi - task learning paradigm. From the perspective of communication theory, the paper also discusses how to improve the performance of LLMs by reducing encoding errors, decoding errors and continuously reducing information misunderstanding through multi - round interactions.