RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection

Qichao Ying,Jiaxin Liu,Sheng Li,Haisheng Xu,Zhenxing Qian,Xinpeng Zhang
2023-07-20
Abstract:The widespread use of face retouching filters on short-video platforms has raised concerns about the authenticity of digital appearances and the impact of deceptive advertising. To address these issues, there is a pressing need to develop advanced face retouching techniques. However, the lack of large-scale and fine-grained face retouching datasets has been a major obstacle to progress in this field. In this paper, we introduce RetouchingFFHQ, a large-scale and fine-grained face retouching dataset that contains over half a million conditionally-retouched images. RetouchingFFHQ stands out from previous datasets due to its large scale, high quality, fine-grainedness, and customization. By including four typical types of face retouching operations and different retouching levels, we extend the binary face retouching detection into a fine-grained, multi-retouching type, and multi-retouching level estimation problem. Additionally, we propose a Multi-granularity Attention Module (MAM) as a plugin for CNN backbones for enhanced cross-scale representation learning. Extensive experiments using different baselines as well as our proposed method on RetouchingFFHQ show decent performance on face retouching detection. With the proposed new dataset, we believe there is great potential for future work to tackle the challenging problem of real-world fine-grained face retouching detection.
Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the authenticity issues brought about by the widespread use of facial retouching filters on social media platforms and their impact on deceptive advertising. Specifically, the paper attempts to solve the following main problems: 1. **Insufficient Dataset Size**: Existing facial retouching datasets are relatively small in scale. The largest dataset, FFHQR, contains about 70,000 retouched images, while other datasets contain fewer than 10,000 samples. These datasets may be insufficient for training robust detection networks. 2. **Lack of Fine-Grained Annotations**: Current datasets lack fine-grained annotations for the types and degrees of facial retouching. For example, some datasets (like VWU) only include facial images of Caucasian women and have limited retouching types. Other datasets (like ND-IIITD, Rathgeb, etc.) only use binary classification methods, ignoring different types and degrees of retouching. 3. **Need for Multi-Label Classification**: The paper proposes the need to construct a more fine-grained facial retouching dataset to train classifiers capable of capturing various types and degrees of retouching. To overcome the limitations of existing datasets, the authors have constructed a large-scale fine-grained facial retouching dataset—RetouchingFFHQ. This dataset contains over 500,000 retouched facial images and provides four typical retouching operations (eye enlargement, face lifting, skin smoothing, and face whitening) as well as four levels of retouching (no retouching, slight, moderate, heavy). Additionally, the authors propose a Multi-Granularity Attention Module (MAM) to enhance cross-scale representation learning, thereby improving facial retouching detection performance.