Abstract:As a crucial application in privacy protection, scene text removal (STR) has received amounts of attention in recent years. However, existing approaches coarsely erasing texts from images ignore two important properties: the background texture integrity (BI) and the text erasure exhaustivity (EE). These two properties directly determine the erasure performance, and how to maintain them in a single network is the core problem for STR task. In this paper, we attribute the lack of BI and EE properties to the implicit erasure guidance and imbalanced multi-stage erasure respectively. To improve these two properties, we propose a new ProgrEssively Region-based scene Text eraser (PERT). There are three key contributions in our study. First, a novel explicit erasure guidance is proposed to enhance the BI property. Different from implicit erasure guidance modifying all the pixels in the entire image, our explicit one accurately performs stroke-level modification with only bounding-box level annotations. Second, a new balanced multi-stage erasure is constructed to improve the EE property. By balancing the learning difficulty and network structure among progressive stages, each stage takes an equal step towards the text-erased image to ensure the erasure exhaustivity. Third, we propose two new evaluation metrics called BI-metric and EE-metric, which make up the shortcomings of current evaluation tools in analyzing BI and EE properties. Compared with previous methods, PERT outperforms them by a large margin in both BI-metric ( %) and EE-metric ( %), obtaining SOTA results with high speed (71 FPS) and at least 25% lower parameter complexity. Code will be available at https://github.com/wangyuxin87/PERT.

Self-Supervised Text Erasing with Controllable Image Synthesis

Progressive Scene Text Erasing with Self-Supervision.

Stroke-Based Scene Text Erasing Using Synthetic Data for Training

MagicEraser: Erasing Any Objects via Semantics-Aware Control

TeSTNeRF: Text-Driven 3D Style Transfer Via Cross-Modal Learning.

DeepEraser: Deep Iterative Context Mining for Generic Text Eraser

Modeling Stroke Mask for End-to-End Text Erasing

Editing Text in the Wild

What is the Real Need for Scene Text Removal? Exploring the Background Integrity and Erasure Exhaustivity Properties

Scene text removal via cascaded text stroke detection and erasing

PERT: A Progressively Region-based Network for Scene Text Removal

Exploring Stroke-Level Modifications for Scene Text Editing

A Simple and Strong Baseline: Progressively Region-based Scene Text Removal Networks

RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles

MTRNet: A Generic Scene Text Eraser

Scene Style Text Editing

Scene Text Eraser

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing

TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images