Generating Automatically Print/Scan Textures for Morphing Attack Detection Applications

Juan E. Tapia,Maximilian Russo,Christoph Busch
2024-08-19
Abstract:Morphing Attack Detection (MAD) is a relevant topic that aims to detect attempts by unauthorised individuals to access a "valid" identity. One of the main scenarios is printing morphed images and submitting the respective print in a passport application process. Today, small datasets are available to train the MAD algorithm because of privacy concerns and the limitations resulting from the effort associated with the printing and scanning of images at large numbers. In order to improve the detection capabilities and spot such morphing attacks, it will be necessary to have a larger and more realistic dataset representing the passport application scenario with the diversity of devices and the resulting printed scanned or compressed images. Creating training data representing the diversity of attacks is a very demanding task because the training material is developed manually. This paper proposes two different methods based on transfer-transfer for automatically creating digital print/scan face images and using such images in the training of a Morphing Attack Detection algorithm. Our proposed method can reach an Equal Error Rate (EER) of 3.84% and 1.92% on the FRGC/FERET database when including our synthetic and texture-transfer print/scan with 600 dpi to handcrafted images, respectively.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve an important problem in **Morphing Attack Detection (MAD)**, that is, how to effectively detect cases where unauthorized individuals attempt to use the "legal" identities of others. Specifically, the paper focuses on the attack behavior of impersonating others by printing and submitting synthetic facial images during the passport application process. #### Background of the main problem 1. **Dataset limitations**: - Currently, the datasets used to train MAD algorithms are very limited, mainly due to privacy issues and the high cost of a large number of printed and scanned images. - Such small - scale and less realistic datasets cannot fully represent the diversity and complexity in practical application scenarios, resulting in limited detection capabilities of MAD systems. 2. **Limitations of manually generated data**: - The process of manually creating printed/scanned images is very time - consuming and labor - intensive. Every time a new scanning device is replaced, this process needs to be repeated to ensure compliance with technical specifications. - This method is not only inefficient but also difficult to ensure data consistency and quality. #### Solutions proposed in the paper To overcome the above problems, the paper proposes two transfer - learning - based methods for automatically generating digital printed/scanned facial images and applying them to the training of MAD algorithms: 1. **Image generation methods based on Pix2pix and CycleGAN**: - Use the Pix2pix algorithm for paired - image conversion, converting the original digital image into a simulated printed/scanned image. - Use the CycleGAN algorithm for unpaired - image conversion to further improve the diversity and authenticity of the generated images. 2. **Semi - automatic texture transfer method**: - Extract the noise/artifacts generated during the printing and scanning process and directly apply them to real and synthetic facial images to simulate the real printing/scanning effect. #### Experimental results and evaluation - The paper shows the performance of the images generated by these methods on the Frechet Inception Distance (FID), verifying their similarity to real printed/scanned images. - By conducting experiments on the FRGC/FERET databases, the paper proves that these methods can significantly improve the detection performance of MAD systems. In particular, in the single - image MAD (S - MAD) task, the equal error rates (EER) reach 3.84% and 1.92% respectively. ### Summary The main contribution of this paper lies in proposing an efficient and automated method to generate a large number of printed/scanned facial images, thereby enriching the training datasets of MAD systems and improving detection performance. This not only solves the problem of insufficient existing datasets but also provides new ideas and tools for future research.