Finding AI-Generated Faces in the Wild

Gonzalo J. Aniano Porcile,Jack Gindi,Shivansh Mundra,James R. Verbus,Hany Farid
2024-04-06
Abstract:AI-based image generation has continued to rapidly improve, producing increasingly more realistic images with fewer obvious visual flaws. AI-generated images are being used to create fake online profiles which in turn are being used for spam, fraud, and disinformation campaigns. As the general problem of detecting any type of manipulated or synthesized content is receiving increasing attention, here we focus on a more narrow task of distinguishing a real face from an AI-generated face. This is particularly applicable when tackling inauthentic online accounts with a fake user profile photo. We show that by focusing on only faces, a more resilient and general-purpose artifact can be detected that allows for the detection of AI-generated faces from a variety of GAN- and diffusion-based synthesis engines, and across image resolutions (as low as 128 x 128 pixels) and qualities.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve The paper primarily focuses on how to distinguish between real human faces and AI-generated face images on large-scale network platforms. Specifically, researchers have found that with the advancement of AI-based image generation technologies (such as Generative Adversarial Networks (GANs) and diffusion models), the generated face images are becoming increasingly realistic and are being used to create fake online profiles, thereby facilitating spam, fraud, and misinformation activities. Therefore, the core issue of the paper is: **How to develop a fast and reliable technique to distinguish between real human face images and AI-generated face images?** To achieve this goal, the researchers adopted the following strategies: - **Focus on Face Images**: By concentrating on faces rather than other types of synthetic images, the researchers discovered a semantic-level feature that can effectively distinguish face images generated by various GANs and diffusion models. - **Extensive Training Dataset**: A large number of images from different generation engines (including various versions of StyleGAN, Stable Diffusion, etc.) were used for model training to ensure that the model can generalize to unseen generation engines. - **Robustness Testing**: The model's performance was tested under different resolutions, image qualities, and JPEG compression conditions to verify its robustness against various processing methods. Through the above methods, the research team proposed an efficient method to detect fake user avatars on social media that may be used for fraudulent activities.