MRD: A Memory Relation Decoder for Online Handwritten Mathematical Expression Recognition

Jiaming Wang,Qing Wang,Jun Du,Jianshu Zhang,Bin Wang,Bo Ren
DOI: https://doi.org/10.1007/978-3-030-86334-0_3
2021-01-01
Abstract:Recently, attention based encoder-decoder methods have been widely used in online handwritten mathematical expression recognition, which achieve significant improvements compared to traditional methods. The encoder-decoder methods usually employ string decoders to generate the recognition result, which are not well matched for tree-structured languages like math expression. A novel sequential relation decoder (SRD) was introduced to recognize the online mathematical expression as a math tree, which can be decomposed into a subtree sequence and each subtree consists of a relation node and two symbol nodes (related symbol node and primary symbol node). However, the alignments between these two symbol nodes were implemented by spatial attention probabilities, leading to incorrect recognition if spatial attention is not accurate. In this paper, we propose a memory relation decoder (MRD), equipped with a memory based attention model to determine the correspondence between two symbol nodes. Specifically, at each decoding step, this memory based attention finds the corresponding primary symbol node in the memory and treats it as the related symbol node, which actually achieves the alignments between two symbol nodes in an explicit manner. Besides, we propose to introduce global visual information while calculating attention probabilities to help alleviate the ambiguous problems in online handwritten mathematical expression recognition. Evaluated on a benchmark published by CROHME competition, the proposed approach can substantially outperform previous encoder-decoder methods.
What problem does this paper attempt to address?