ARPIC: Personal Information Recognition of ID Card with Interference of Watermark

Wenqiang Miao,Chunhong Zhang,Zheng Hu,Chenhe Zhang
DOI: https://doi.org/10.1109/iccc54389.2021.9674471
2021-01-01
Abstract:Although Optical character recognition (OCR) in printed document texts is well developed, it’s still challenging to recognize the personal information in copy of ID card (CIC) under the interference of visible watermark. To address this issue, we present a framework named ARPIC for automatically recognizing the personal information in CIC. The proposed framework consists of four parts: ID card region detection module (IRDM) to locate and rectify the ID card region, watermark removing module (WRM) to remove the visible watermark overlaying on the ID card region, key texts locating and recognition module (KLRM) to recognize the key texts and text correction module (TCM) to correct the recognition results. Combining data-driven sub-models WRM and KLRM with rule-based sub-models IRDM and TCM, the ARPIC is robust, effective and flexible. Besides, A synthetic dataset has been used for training WRM and KLRM, and for evaluating the performance of ARPIC.
What problem does this paper attempt to address?