Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data

Jiaming Zhou,Abbas Ghaddar,Ge Zhang,Liheng Ma,Yaochen Hu,Soumyasundar Pal,Mark Coates,Bin Wang,Yingxue Zhang,Jianye Hao
2024-09-19
Abstract:Despite recent advances in training and prompting strategies for Large Language Models (LLMs), these models continue to face challenges with complex logical reasoning tasks that involve long reasoning chains. In this work, we explore the potential and limitations of using graph-based synthetic reasoning data as training signals to enhance LLMs' reasoning capabilities. Our extensive experiments, conducted on two established natural language reasoning tasks -- inductive reasoning and spatial reasoning -- demonstrate that supervised fine-tuning (SFT) with synthetic graph-based reasoning data effectively enhances LLMs' reasoning performance without compromising their effectiveness on other standard evaluation benchmarks.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the poor performance of large - language models (LLMs) in complex logical reasoning tasks, especially those involving long reasoning chains. Although progress has been made in training and prompting strategies, these models are still difficult to handle multi - step reasoning tasks and are very sensitive to minor perturbations in input prompts. The paper aims to overcome these challenges by exploring graph - based synthetic reasoning data as a training signal to enhance the reasoning ability of LLMs. Specifically, the authors propose a random walk sampling algorithm to generate synthetic graph - structured data and introduce a new prompting strategy (ETA - P, i.e., "Extract Then Answer") to improve the reasoning performance of LLMs on specific tasks. Through extensive experiments on two established natural - language reasoning tasks - inductive reasoning and spatial reasoning, the research has proven that supervised fine - tuning (SFT) using synthetic graph - based reasoning data can effectively improve the reasoning performance of LLMs without affecting their performance on other standard evaluation benchmarks.