Robust Prompt Optimization for Large Language Models Against Distribution Shifts

Moxin Li,Wenjie Wang,Fuli Feng,Yixin Cao,Jizhi Zhang,Tat-Seng Chua
DOI: https://doi.org/10.18653/v1/2023.emnlp-main.95
2023-01-01
Abstract:Large Language Model (LLM) has demonstrated significant ability in variousNatural Language Processing tasks. However, their effectiveness is highlydependent on the phrasing of the task prompt, leading to research on automaticprompt optimization using labeled task data. We reveal that these promptoptimization techniques are vulnerable to distribution shifts such assubpopulation shifts, which are common for LLMs in real-world scenarios such ascustomer reviews analysis. In this light, we propose a new problem of robustprompt optimization for LLMs against distribution shifts, which requires theprompt optimized over the labeled source group can simultaneously generalize toan unlabeled target group. To solve this problem, we propose Generalized PromptOptimization framework, which incorporates the unlabeled data from the targetgroup into prompt optimization. Extensive experimental results demonstrate theeffectiveness of the proposed framework with significant performanceimprovement on the target group and comparable performance on the source group.
What problem does this paper attempt to address?