First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning

Yoichi Aoki,Keito Kudo,Tatsuki Kuribayashi,Shusaku Sone,Masaya Taniguchi,Keisuke Sakaguchi,Kentaro Inui
2024-10-07
Abstract:Multi-step reasoning instruction, such as chain-of-thought prompting, is widely adopted to explore better language models (LMs) performance. We report on the systematic strategy that LMs employ in such a multi-step reasoning process. Our controlled experiments reveal that LMs rely more heavily on heuristics, such as lexical overlap, in the earlier stages of reasoning, where more reasoning steps remain to reach a goal. Conversely, their reliance on heuristics decreases as LMs progress closer to the final answer through multiple reasoning steps. This suggests that LMs can backtrack only a limited number of future steps and dynamically combine heuristic strategies with rationale ones in tasks involving multi-step reasoning.
Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is whether language models (LMs) adopt systematic strategies similar to humans when performing multi-step reasoning, i.e., initially relying on heuristic methods and subsequently shifting to more rational reasoning strategies. The study found that in the early stages of multi-step reasoning, language models are more inclined to rely on shallow heuristic strategies such as lexical overlap; however, as they approach the final answer, this reliance gradually decreases. This indicates that when faced with complex multi-step reasoning tasks, language models can only backtrack a limited number of future steps and are capable of dynamically combining heuristic and rational strategies. This research not only helps to understand the intrinsic mechanisms of language models in problem-solving but also provides language model users with insights into the inherent biases of the system. Through experimental analysis, the paper demonstrates the differences among various models in handling heuristic biases and verifies the phenomenon of dynamic strategy switching exhibited by these models during the multi-step reasoning process.