Discriminative estimation of probabilistic context-free grammars for mathematical expression recognition and retrieval

Ernesto Noya,José Miguel Benedí,Joan Andreu Sánchez,Dan Anitei
DOI: https://doi.org/10.1007/s10044-023-01158-8
IF: 2.307
2023-04-18
Pattern Analysis and Applications
Abstract:Abstract We present a discriminative learning algorithm for the probabilistic estimation of two-dimensional probabilistic context-free grammars (2D-PCFG) for mathematical expressions recognition and retrieval. This algorithm is based on a generalization of the H-criterion as the objective function and the growth transformations as the optimization method. For the development of the discriminative estimation algorithm, the N -best interpretations provided by the 2D-PCFG have been considered. Experimental results are reported on two available datasets: Im2Latex and IBEM . The first experiment compares the proposed discriminative estimation method with the classic Viterbi-based estimation method. The second one studies the performance of the estimated models depending on the length of the mathematical expressions and the number of admissible errors in the metric used.
computer science, artificial intelligence
What problem does this paper attempt to address?