Integrating Coordinates with Context for Information Extraction in Document Images

Zhaohui Jiang,Zheng Huang,Yunrui Lian,Jie Guo,Weidong Qiu
DOI: https://doi.org/10.1109/icdar.2019.00065
2019-01-01
Abstract:Information extraction from document collections is a fundamental and important step to understand, structure and analyze data. Many approaches with rules and deep learning based techniques have been applied on plain text, however, when it comes to document images, such demand still exists but becomes quite challenging without linguistic knowledge. In this paper, we propose an approach to extract required named entities (NEs) from document images by integrating the coordinate information from the detection and recognition stage into the contextual information of the BiLSTM-CRF model with an attention mechanism. We test this method on two real-world datasets. One is a Contract Dataset of Listed Companies, and the other is an Insurance Policy Dataset of our own. The result shows a combination of coordinates and context with attention leverages extraction in document images, opening up potential applications on such tasks.
What problem does this paper attempt to address?