Self-Polish: Enhance Reasoning in Large Language Models Via Problem Refinement.

Zhiheng Xi,Senjie Jin,Yuhao Zhou,Rui Zheng,Songyang Gao,Jia Liu,Tao Gui,Qi Zhang,Xuanjing Huang
DOI: https://doi.org/10.18653/v1/2023.findings-emnlp.762
2023-01-01
Abstract:To enhance the multi-step reasoning capabilities of large language models,researchers have extensively explored prompting methods, notably theChain-of-Thought (CoT) method which explicitly elicits human-like rationales.However, they have inadvertently overlooked the potential of enhancing modelreasoning performance by formulating higher-quality problems. In this work, westart from the problem side and propose Self-Polish (SP), a novel method thatfacilitates the model's reasoning by guiding it to progressively refine thegiven problems to be more comprehensible and solvable. We also explore severalautomatic prompting varients and propose the Self-Polish prompt bank for thecommunity. SP is orthogonal to all other prompting methods of answer/reasoningside like CoT, allowing for seamless integration with state-of-the-arttechniques for further improvement. Thorough experiments show that the proposedmethod attains notable and consistent effectiveness on five reasoningbenchmarks across different models. Furthermore, our method also showcasesimpressive performance on robustness evaluation. Codes and prompts areavailable at https://github.com/WooooDyy/Self-Polish.
What problem does this paper attempt to address?