SkyMath: Technical Report

Liu Yang,Haihua Yang,Wenjun Cheng,Lei Lin,Chenxia Li,Yifu Chen,Lunan Liu,Jianfei Pan,Tianwen Wei,Biye Li,Liang Zhao,Lijie Wang,Bo Zhu,Guoliang Li,Xuejie Wu,Xilin Luo,Rui Hu
DOI: https://doi.org/10.48550/arXiv.2310.16713
2023-10-26
Abstract:Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning. In this work, we present SkyMath, a large language model for mathematics with 13 billion parameters. By applying self-compare fine-tuning, we have enhanced mathematical reasoning abilities of Skywork-13B-Base remarkably. On GSM8K, SkyMath outperforms all known open-source models of similar size and has established a new SOTA performance.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?