ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Yifan Xu,Xiao Liu,Xinghan Liu,Zhenyu Hou,Yueyan Li,Xiaohan Zhang,Zihan Wang,Aohan Zeng,Zhengxiao Du,Wenyi Zhao,Jie Tang,Yuxiao Dong
2024-04-04
Abstract:Large language models (LLMs) have shown excellent mastering of human language, but still struggle in real-world applications that require mathematical problem-solving. While many strategies and datasets to enhance LLMs' mathematics are developed, it remains a challenge to simultaneously maintain and improve both language and mathematical capabilities in deployed LLM
Computation and Language
What problem does this paper attempt to address?
The problem addressed in this paper is the shortcomings of large-scale language models in solving mathematical problems. Despite their proficiency in human language, these models struggle in practical applications that require mathematical reasoning. The study proposes a self-critical pipeline that improves the model's language and mathematical abilities through feedback learning stages. This method consists of two stages: rejection fine-tuning and direct preference optimization, which enhance mathematical skills while preserving language proficiency without compromising either. Through experiments, this method significantly improves the mathematical problem-solving abilities of the ChatGLM3-32B model and performs exceptionally well on the newly created challenging dataset MATHUSEREVAL.