Abstract:With the ongoing popularization of online services, the digital document images have been used in various applications. Meanwhile, there have emerged some deep learning-based text editing algorithms which alter the textual information of an image . In this work, we present a document forgery algorithm to edit practical document images. To achieve this goal, the limitations of existing text editing algorithms towards complicated characters and complex background are addressed by a set of network design strategies. First, the unnecessary confusion in the supervision data is avoided by disentangling the textual and background information in the source images. Second, to capture the structure of some complicated components, the text skeleton is provided as auxiliary information and the continuity in texture is considered explicitly in the loss function. Third, the forgery traces induced by the text editing operation are mitigated by some post-processing operations which consider the distortions from the print-and-scan channel. Quantitative comparisons of the proposed method and the exiting approach have shown the advantages of our design by reducing the about 2/3 reconstruction error measured in MSE, improving reconstruction quality measured in PSNR and in SSIM by 4 dB and 0.21, respectively. Qualitative experiments have confirmed that the reconstruction results of the proposed method are visually better than the existing approach. More importantly, we have demonstrated the performance of the proposed document forgery algorithm under a practical scenario where an attacker is able to alter the textual information in an identity document using only one sample in the target domain. The forged-and-recaptured samples created by the proposed text editing attack and recapturing operation have successfully fooled some existing document authentication systems.

Self-supervised Deep Reconstruction of Mixed Strip-shredded Text Documents

Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised Deep Asymmetric Metric Learning

A semi-automatic deshredding method based on curve matching

Research on the Problem of Shredded Document Reconstruction Based on Matching of Pixels for Boundary

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

Task-driven single-image super-resolution reconstruction of document scans

DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images

Data Reconstruction Based on Supervised Deep Auto-Encoder.

Intrinsic Decomposition of Document Images In-the-Wild

Decomposer: Semi-supervised Learning of Image Restoration and Image Decomposition

Self-Supervised Text Erasing with Controllable Image Synthesis

Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions

TextRecon for Blind Polluted Text Image Reconstruction

Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents

Deep Learning-based Forgery Attack on Document Images

Text Line Segmentation from Struck-out Handwritten Document Images

Attention-based Feature Decomposition-Reconstruction Network for Scene Text Detection

Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach

Unsupervised Document Summarization from Data Reconstruction Perspective.

Self-Supervised Memory Learning for Scene Text Image Super-Resolution

DORec: Decomposed Object Reconstruction and Segmentation Utilizing 2D Self-Supervised Features