Document Page Structure Learning For Fixed-Layout E-Books Using Conditional Random Fields

Xin Tao,Zhi Tang,Canhui Xu
DOI: https://doi.org/10.1117/12.2039492
2014-01-01
Abstract:In this paper, a model is proposed to learn logical structure of fixed-layout document pages by combining support vector machine (SVM) and conditional random fields (CRF). Features related to each logical label and their dependencies are extracted from various original Portable Document Format (PDF) attributes. Both local evidence and contextual dependencies are integrated in the proposed model so as to achieve better logical labeling performance. With the merits of SVM as local discriminative classifier and CRF modeling contextual correlations of adjacent fragments, it is capable of resolving the ambiguities of semantic labels. The experimental results show that CRF based models with both tree and chain graph structures outperform the SVM model with an increase of macro-averaged F-1 by about 10%.
What problem does this paper attempt to address?