FormalGeo: An Extensible Formalized Framework for Olympiad Geometric Problem Solving

Xiaokai Zhang,Na Zhu,Yiming He,Jia Zou,Qike Huang,Xiaoxiao Jin,Yanjun Guo,Chenyang Mao,Yang Li,Zhe Zhu,Dengfeng Yue,Fangzhen Zhu,Yifan Wang,Yiwen Huang,Runan Wang,Cheng Qin,Zhenbing Zeng,Shaorong Xie,Xiangfeng Luo,Tuo Leng
2024-02-15
Abstract:This is the first paper in a series of work we have accomplished over the past three years. In this paper, we have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. Within this formal framework, we have been able to seamlessly integrate modern AI models with our formal system. AI is now capable of providing deductive reasoning solutions to IMO-level plane geometry problems, just like handling other natural languages, and these proofs are readable, traceable, and verifiable. We propose the geometry formalization theory (GFT) to guide the development of the geometry formal system. Based on the GFT, we have established the FormalGeo, which consists of 88 geometric predicates and 196 theorems. It can represent, validate, and solve IMO-level geometry problems. we also have crafted the FGPS (formal geometry problem solver) in Python. It serves as both an interactive assistant for verifying problem-solving processes and an automated problem solver. We've annotated the formalgeo7k and formalgeo-imo datasets. The former contains 6,981 (expand to 133,818 through data augmentation) geometry problems, while the latter includes 18 (expand to 2,627 and continuously increasing) IMO-level challenging geometry problems. All annotated problems include detailed formal language descriptions and solutions. Implementation of the formal system and experiments validate the correctness and utility of the GFT. The backward depth-first search method only yields a 2.42% problem-solving failure rate, and we can incorporate deep learning techniques to achieve lower one. The source code of FGPS and datasets are available at <a class="link-external link-https" href="https://github.com/BitSecret/FGPS" rel="external noopener nofollow">this https URL</a>.
Artificial Intelligence
What problem does this paper attempt to address?