Abstract:With the introduction of large language models (LLMs), automatic math reasoning has seen tremendous success. However, current methods primarily focus on providing solutions or using techniques like Chain-of-Thought to enhance problem-solving accuracy. In this paper, we focus on improving the capability of mathematics teaching via a Socratic teaching-based LLM (\texttt{SocraticLLM}), which guides learners toward profound thinking with clarity and self-discovery via conversation. We collect and release a high-quality mathematical teaching dataset, named \texttt{SocraticMATH}, which provides Socratic-style conversations of problems with extra knowledge. Also, we propose a knowledge-enhanced LLM as a strong baseline to generate reliable responses with review, guidance/heuristic, rectification, and summarization. Experimental results show the great advantages of \texttt{SocraticLLM} by comparing it with several strong generative models. The codes and datasets are available on \url{<a class="link-external link-https" href="https://github.com/ECNU-ICALK/SocraticMath" rel="external noopener nofollow">this https URL</a>}.

What problem does this paper attempt to address?

The problem this paper attempts to address is how to enhance the effectiveness of mathematics teaching by combining the Socratic method with large language models (LLMs). Specifically, existing mathematical LLMs mainly focus on improving mathematical problem-solving abilities, but they have limitations in teaching applications. For example, when faced with mathematical problems, LLMs tend to directly provide the solution process, even when playing the role of a teacher. This leads to a lack of independent thinking and discovery process for students. Therefore, this study aims to design an LLM based on the Socratic method (referred to as SocraticLLM) to guide students to achieve deep thinking and self-discovery through dialogue, thereby improving the quality of mathematics teaching. To achieve this goal, the researchers proposed several key innovations: 1. **Knowledge-enhanced SocraticLLM**: As a powerful baseline model, SocraticLLM generates reliable responses by incorporating additional knowledge, including teaching skills such as review, inspiration, correction, and summarization. 2. **Building a high-quality dataset**: The researchers created and released a high-quality mathematics teaching dataset named SocraticMATH, which includes original questions, answers, solutions, and dialogue tutoring data covering 513 knowledge points in elementary mathematics. 3. **Experimental validation**: Through a series of experiments, the researchers demonstrated the effectiveness of SocraticLLM in both automatic and manual evaluations, particularly excelling in reducing hallucinations (i.e., the generation of incorrect information by the model). In summary, the core issue of this paper is to explore how to use the Socratic method to improve the application of LLMs in mathematics teaching, in order to promote students' independent learning and deep understanding.

Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching

Large Language Models for Mathematical Reasoning: Progresses and Challenges

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

MathLearner: A Large Language Model Agent Framework for Learning to Solve Mathematical Problems

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From A Psychological Perspective

MathChat: Converse to Tackle Challenging Math Problems with LLM Agents

Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Solving Math Word Problems by Combining Language Models With Symbolic Solvers

Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions