Abstract: With the widespread use of large language models (LLMs) in NLP tasks, researchers have discovered the potential of Chain-of-thought (CoT) to assist LLMs in accomplishing complex reasoning tasks by generating intermediate steps. However, human thought processes are often non-linear, rather than simply sequential chains of thoughts. Therefore, we propose Graph-of-Thought (GoT) reasoning, which models human thought processes not only as a chain but also as a graph. By representing thought units as nodes and connections between them as edges, our approach captures the non-sequential nature of human thinking and allows for a more realistic modeling of thought processes. Similar to Multimodal-CoT, we modeled GoT reasoning as a two-stage framework, generating rationales first and then producing the final answer. Specifically, we employ an additional graph-of-thoughts encoder for GoT representation learning and fuse the GoT representation with the original input representation through a gated fusion mechanism. We implement a GoT reasoning model on the T5 pre-trained model and evaluate its performance on a text-only reasoning task (GSM8K) and a multimodal reasoning task (ScienceQA). Our model achieves significant improvement over the strong CoT baseline with 3.41% and 5.08% on the GSM8K test set with T5-base and T5-large architectures, respectively. Additionally, our model boosts accuracy from 84.91% to 91.54% using the T5-base model and from 91.68% to 92.77% using the T5-large model over the state-of-the-art Multimodal-CoT on the ScienceQA test set. Experiments have shown that GoT achieves comparable results to Multimodal-CoT(large) with over 700M parameters, despite having fewer than 250M backbone model parameters, demonstrating the effectiveness of GoT.

ToW: Thoughts of Words Improve Reasoning in Large Language Models

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Improve Vision Language Model Chain-of-thought Reasoning

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings

Think from Words(TFW): Initiating Human-Like Cognition in Large Language Models Through Think from Words for Japanese Text-level Classification

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

Constrained Reasoning Chains for Enhancing Theory-of-Mind in Large Language Models

Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities

Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models

Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination