Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs

Ioktong Lei,Zhidong Deng
2024-06-05
Abstract:As a way of communicating with users and any LLMs like GPT or PaLM2, prompting becomes an increasingly important research topic for better utilization of LLMs. Although simple prompting performs well on single-step questions, it cannot permanently activate the correct knowledge path for multi-step reasoning tasks. The chain of thought (CoT), which often contains zero-shot CoT and few-shot CoT, is a recently developed prompting method that can explain the reasoning process to the LLM and outperforms simple prompting in three challenging reasoning tasks, including arithmetic, symbolic, and commonsense reasoning. In this paper, we propose a novel hint of thought (HoT) prompting with explainability and zero-shot generalization. First, it is decomposed into the following three steps: explainable sub-questions, logical reasoning, and answer extraction. Second, such three steps are sequentially ordered in the format of step-by-step hints, which can be easily adjusted and explained to different tasks. Finally, experimental results demonstrate that our HoT prompting has a significant advantage on the zero-shot reasoning task compared to existing zero-shot CoT. We did zero-shot experiments on math tasks like GSM8K, ADDSUB, AQUA, SVAMP and commonsense tasks such as StrategyQA. In particular, the accuracy of the proposed HoT prompting is improved with GSM8K from 40.50% to 67.80%, with AQUA from 31.9% to 46.4%, with SVAMP from 63.7% to 76.9%, and with ADDSUB from 74.7% to 87.34%, respectively, which even defeats the competitive PoT approach on GSM8k, AQUA, and SVAMP.
Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the performance of large - language models (LLMs) in multi - step reasoning tasks under zero - shot conditions. Specifically, although the existing simple prompting methods perform well on single - step problems, they cannot effectively activate the correct knowledge paths in tasks requiring multi - step reasoning. To overcome this limitation, the paper proposes a new prompting method - "Hint of Thought (HoT)", which is not only interpretable but also capable of effective multi - step reasoning under zero - shot conditions. ### Main contributions of the paper 1. **Proposed a new zero - shot reasoning method**: The HoT method improves the performance of LLMs in multi - step reasoning tasks by decomposing complex problems into a series of sub - problems and using pseudo - code to express the logical reasoning process. 2. **Improved the accuracy of zero - shot reasoning**: Experimental results show that the HoT method significantly outperforms the existing zero - shot Chain - of - Thought (zero - shot CoT) and other methods on multiple benchmark datasets. 3. **Provided an interpretable reasoning process**: The HoT method not only improves the accuracy of reasoning but also provides a transparent and logically clear reasoning process, making it easier for users to understand the reasoning steps of the model. ### Specific improvements - **Sub - problem decomposition**: Decompose complex problems into multiple sub - problems, and each sub - problem requires the model to perform specific reasoning. - **Pseudo - code logical reasoning**: Use pseudo - code to express the reasoning process of each sub - problem, avoiding the semantic ambiguity problem in natural language processing. - **Answer extraction**: Finally, extract the required answers from the model's output and present them in the format specified by the user. ### Experimental results - **Mathematical reasoning tasks**: On the GSM8K, AQUA, SVAMP, and ADDSUB datasets, the HoT method increased the accuracy from 40.50% to 67.80%, from 31.9% to 46.4%, from 63.7% to 76.9%, and from 74.7% to 87.34% respectively. - **Common - sense reasoning tasks**: On the StrategyQA dataset, the HoT method increased the accuracy from 52.3% to 82.96%. ### Conclusion By proposing the HoT method, the paper successfully solves the problem of how to improve the performance of LLMs in multi - step reasoning tasks under zero - shot conditions. Experimental results show that the HoT method not only improves the accuracy of reasoning but also provides a transparent and logically clear reasoning process, providing a new direction for future research.