Reverse Prompt Engineering

Hanqing Li,Diego Klabjan

2024-11-11

Abstract:This paper explores a new black-box, zero-shot language model inversion problem and proposes an innovative framework for prompt reconstruction using only text outputs from a language model. Leveraging a large language model alongside an optimization algorithm, the proposed method effectively recovers prompts with minimal resources. Experimental results on several datasets derived from public sources indicate that the proposed approach achieves high-quality prompt recovery and generates prompts more similar to the originals than current state-of-the-art methods. Additionally, the use-case study demonstrates the method's strong potential for generating high-quality text data.

Computation and Language

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to reverse - engineer and recover the original input prompt from the text output generated by large - language models (LLMs) under black - box and zero - sample conditions. Specifically, the researchers proposed a new method - Reverse Prompt Engineering (RPE), which can infer the original prompt by using the reasoning ability of the target LLM and combining an iterative optimization algorithm when only a small amount of text output is provided. Compared with previous methods, RPE does not require access to the internal parameters of the model or a large amount of output data, nor does it require an additional training process. This makes RPE particularly suitable for dealing with closed - source LLMs such as GPT - 3.5. In addition, RPE shows higher accuracy and efficiency in prompt recovery, and its average cosine similarity on different embedding models is 5.2% higher than that of the current state - of - the - art method. The main contributions of the paper are: - Providing the first research on the reverse problem of language models under black - box and zero - sample conditions. - Proposing an innovative method to recover prompts from text output using only LLMs. - Designing a novel optimization algorithm that uses the LLM itself as an optimizer to improve the accuracy of prompt recovery. Verified by experiments, RPE not only has technological breakthroughs but also shows great potential in practical applications, such as being used to generate high - quality text data, such as marketing plans, video game designs, and lyric writing.

Reverse Prompt Engineering

Extracting Prompts by Inverting LLM Outputs

Prompt Stealing Attacks Against Large Language Models

Effective Prompt Extraction from Language Models

Language Model Inversion

Prompt Engineering a Prompt Engineer

Uncovering Hidden Intentions: Exploring Prompt Recovery for Deeper Insights into Generated Texts

A Brief History of Prompt: Leveraging Language Models. (Through Advanced Prompting)

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models

PRSA: PRompt Stealing Attacks against Large Language Models

Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation

Controllable Generation from Pre-trained Language Models via Inverse Prompting

Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases

Automatic Prompt Optimization with "Gradient Descent" and Beam Search

PromptFix: Few-shot Backdoor Removal via Adversarial Prompt Tuning

EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning

RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models