A bottom-up OCR system for mathematical formulas recognition

Wei Wu,Feng Li,Jun Kong,Lichang Hou,Bingdui Zhu
DOI: https://doi.org/10.1007/11816157_27
2006-01-01
Abstract:An OCR system is presented to understand mathematical formulas in binary printed document images. The system utilizes a novel component-labeling algorithm for extracting local maximum components from image, and uses these components to locate the mathematical formulas. A character recognition algorithm based on neural networks is then adopted. For segmenting merged characters in the image, a novel segmentation algorithm based on a modified SOM neural network was introduced into the system. With the employment of LL(1) grammar, this system can convert the recognition results into a $\mbox{\LaTeX}$ file.
What problem does this paper attempt to address?