Abstract:The widespread use of face retouching filters on short-video platforms has raised concerns about the authenticity of digital appearances and the impact of deceptive advertising. To address these issues, there is a pressing need to develop advanced face retouching techniques. However, the lack of large-scale and fine-grained face retouching datasets has been a major obstacle to progress in this field. In this paper, we introduce RetouchingFFHQ, a large-scale and fine-grained face retouching dataset that contains over half a million conditionally-retouched images. RetouchingFFHQ stands out from previous datasets due to its large scale, high quality, fine-grainedness, and customization. By including four typical types of face retouching operations and different retouching levels, we extend the binary face retouching detection into a fine-grained, multi-retouching type, and multi-retouching level estimation problem. Additionally, we propose a Multi-granularity Attention Module (MAM) as a plugin for CNN backbones for enhanced cross-scale representation learning. Extensive experiments using different baselines as well as our proposed method on RetouchingFFHQ show decent performance on face retouching detection. With the proposed new dataset, we believe there is great potential for future work to tackle the challenging problem of real-world fine-grained face retouching detection.

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper aims to address the authenticity issues brought about by the widespread use of facial retouching filters on social media platforms and their impact on deceptive advertising. Specifically, the paper attempts to solve the following main problems: 1. **Insufficient Dataset Size**: Existing facial retouching datasets are relatively small in scale. The largest dataset, FFHQR, contains about 70,000 retouched images, while other datasets contain fewer than 10,000 samples. These datasets may be insufficient for training robust detection networks. 2. **Lack of Fine-Grained Annotations**: Current datasets lack fine-grained annotations for the types and degrees of facial retouching. For example, some datasets (like VWU) only include facial images of Caucasian women and have limited retouching types. Other datasets (like ND-IIITD, Rathgeb, etc.) only use binary classification methods, ignoring different types and degrees of retouching. 3. **Need for Multi-Label Classification**: The paper proposes the need to construct a more fine-grained facial retouching dataset to train classifiers capable of capturing various types and degrees of retouching. To overcome the limitations of existing datasets, the authors have constructed a large-scale fine-grained facial retouching dataset—RetouchingFFHQ. This dataset contains over 500,000 retouched facial images and provides four typical retouching operations (eye enlargement, face lifting, skin smoothing, and face whitening) as well as four levels of retouching (no retouching, slight, moderate, heavy). Additionally, the authors propose a Multi-Granularity Attention Module (MAM) to enhance cross-scale representation learning, thereby improving facial retouching detection performance.

RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection

HQRetouch: Learning Professional Face Retouching Via Masked Feature Fusion and Semantic-Aware Modulation

High-fidelity 3D Face Reconstruction with Multi-Scale Details

Face2Face: Label-driven Facial Retouching Restoration

Refining Localized Attention Features with Multi-Scale Relationships for Enhanced Deepfake Detection in Spatial-Frequency Domain

Toward High Quality Facial Representation Learning

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning

Demography-based facial retouching detection using subclass supervised sparse autoencoder

A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal

Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model

Pixel-Face: A Large-Scale, High-Resolution Benchmark for 3D Face Reconstruction

Face Forensics in the Wild

Real-time portrait image retouching extended from DualBLN

A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration

Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment

Diverse Dataset for Eyeglasses Detection: Extending the Flickr-Faces-HQ (FFHQ) Dataset

Automatic Facial Skin Feature Detection for Everyone

Survey on Deep Face Restoration: From Non-blind to Blind and Beyond

CelebV-HQ: A Large-Scale Video Facial Attributes Dataset