Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Wenting Tan,Dongxiao Chen,Jieting Xue,Zihao Wang,Taijie Chen

2024-10-11

Abstract:Large Language Models (LLMs) exhibit impressive performance across various domains but still struggle with arithmetic reasoning tasks. Recent work shows the effectiveness of prompt design methods in enhancing reasoning capabilities. However, these approaches overlook crucial requirements for prior knowledge of specific concepts, theorems, and tricks to tackle most arithmetic reasoning problems successfully. To address this issue, we propose a novel and effective Teaching-Inspired Integrated Framework, which emulates the instructional process of a teacher guiding students. This method equips LLMs with essential concepts, relevant theorems, and similar problems with analogous solution approaches, facilitating the enhancement of reasoning abilities. Additionally, we introduce two new Chinese datasets, MathMC and MathToF, both with detailed explanations and answers. Experiments are conducted on nine benchmarks which demonstrates that our approach improves the reasoning accuracy of LLMs. With GPT-4 and our framework, we achieve new state-of-the-art performance on four math benchmarks (AddSub, SVAMP, Math23K and AQuA) with accuracies of 98.2% (+3.3%), 93.9% (+0.2%), 94.3% (+7.2%) and 81.1% (+1.2%). Our data and code are available at <a class="link-external link-https" href="https://github.com/SallyTan13/Teaching-Inspired-Prompting" rel="external noopener nofollow">this https URL</a>.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the performance deficiencies of large language models (LLMs) in arithmetic reasoning tasks. Although LLMs excel in the field of natural language processing (NLP), they still face difficulties when dealing with tasks that require complex reasoning. The paper proposes a novel teaching heuristic comprehensive prompting framework that enhances the model's reasoning ability by providing necessary concepts, theorems, and background knowledge of similar problems, mimicking the way a teacher guides students. Additionally, the paper introduces two new Chinese datasets, MathMC and MathToF, for further research on arithmetic reasoning tasks. Experimental results show that this method significantly improves the reasoning accuracy of LLMs in multiple benchmarks, especially when using the GPT-4 model, achieving the latest best performance on 4 mathematical benchmarks.

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting

Active Prompting with Chain-of-Thought for Large Language Models

Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models

MathPrompter: Mathematical Reasoning using Large Language Models

Multi-tool Integration Application for Math Reasoning Using Large Language Model

Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction

Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models

Progressive-Hint Prompting Improves Reasoning in Large Language Models

Benchmarking Large Language Models for Math Reasoning Tasks

Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning

Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models

Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems

DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Let's Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models

Reasoning with Large Language Models, a Survey

INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models

Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching