Abstract:With the ongoing popularization of online services, the digital document images have been used in various applications. Meanwhile, there have emerged some deep learning-based text editing algorithms which alter the textual information of an image . In this work, we present a document forgery algorithm to edit practical document images. To achieve this goal, the limitations of existing text editing algorithms towards complicated characters and complex background are addressed by a set of network design strategies. First, the unnecessary confusion in the supervision data is avoided by disentangling the textual and background information in the source images. Second, to capture the structure of some complicated components, the text skeleton is provided as auxiliary information and the continuity in texture is considered explicitly in the loss function. Third, the forgery traces induced by the text editing operation are mitigated by some post-processing operations which consider the distortions from the print-and-scan channel. Quantitative comparisons of the proposed method and the exiting approach have shown the advantages of our design by reducing the about 2/3 reconstruction error measured in MSE, improving reconstruction quality measured in PSNR and in SSIM by 4 dB and 0.21, respectively. Qualitative experiments have confirmed that the reconstruction results of the proposed method are visually better than the existing approach. More importantly, we have demonstrated the performance of the proposed document forgery algorithm under a practical scenario where an attacker is able to alter the textual information in an identity document using only one sample in the target domain. The forged-and-recaptured samples created by the proposed text editing attack and recapturing operation have successfully fooled some existing document authentication systems.

An Evaluation of OCR Systems Against Adversarial Machine Learning

Fooling OCR Systems with Adversarial Text Images

Robust CAPTCHAs Towards Malicious OCR

A Black-Box Attack on Optical Character Recognition Systems

Attacking Optical Character Recognition (OCR) Systems with Adversarial Watermarks

ProTegO: Protect Text Content against OCR Extraction Attack

Robust Text CAPTCHAs Using Adversarial Examples

Detection Masking for Improved OCR on Noisy Documents

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack

Efficient, Lexicon-Free OCR using Deep Learning

Bypassing Captcha By Machine A Proof For Passing The Turing Test

Advancing machine learning with OCR2SEQ: an innovative approach to multi-modal data augmentation

Captcha Attack: Turning Captchas Against Humanity

Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition

Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

blessing in disguise: Designing Robust Turing Test by Employing Algorithm Unrobustness

Simple Transparent Adversarial Examples

Advanced Digital Image Processing Technique based Optical Character Recognition of Scanned Document

Undermining Image and Text Classification Algorithms Using Adversarial Attacks

Deep Learning-based Forgery Attack on Document Images

Rerunning OCR: A Machine Learning Approach to Quality Assessment and Enhancement Prediction