Automatic extraction of structured information from elementary level geometry questions into logic forms
Archana Boob,Shiva Reddy,Deep Walke,Harshini Pillarisetti,Shreeya Shukla,Mansi Radke
DOI: https://doi.org/10.1007/s11042-024-20463-w
IF: 2.577
2024-11-29
Multimedia Tools and Applications
Abstract:Geometry has a plethora of practical applications, as real-world situations are often modeled as geometry math word problems (MWPs). Existing approaches to automatically solve MWPs rely on manually generated or regular expression-based techniques to reach their intermediate form making them less scalable and generalizable. The information embedded in the geometry question needs to be converted to a formal machine-understandable form for standard algorithms to process it. To address this challenge, we propose two pipelines, one extracting information from the diagram using image processing techniques and another extracting information from the text using Natural Language Processing techniques. A formal language that consists of 'logic forms' is proposed to represent this information in a structured way. Further, the logic forms generated from both the text and the diagram of the question are combined by performing an union operation. To test the proposed approach, a dataset ElementaryGeometryQA is created containing 500+ questions from standard elementary school-level Indian textbooks. To evaluate the generated logic forms, we create ground truth logic forms for each of the questions through skilled domain experts. We also devise a unique method to evaluate the generated logic forms with the Metric for Evaluation of Translation with Explicit ORdering (METEOR) metric. A score of 0.54 is obtained on the ElementaryGeometryQA dataset. We additionally evaluated it on an available dataset Geometry3K on which the technique obtains a METEOR score of 0.49.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering