Artifact Feature Purification for Cross-domain Detection of AI-generated Images

Zheling Meng,Bo Peng,Jing Dong,Tieniu Tan
2024-03-17
Abstract:In the era of AIGC, the fast development of visual content generation technologies, such as diffusion models, bring potential security risks to our society. Existing generated image detection methods suffer from performance drop when faced with out-of-domain generators and image scenes. To relieve this problem, we propose Artifact Purification Network (APN) to facilitate the artifact extraction from generated images through the explicit and implicit purification processes. For the explicit one, a suspicious frequency-band proposal method and a spatial feature decomposition method are proposed to extract artifact-related features. For the implicit one, a training strategy based on mutual information estimation is proposed to further purify the artifact-related features. Experiments show that for cross-generator detection, the average accuracy of APN is 5.6% ~ 16.4% higher than the previous 10 methods on GenImage dataset and 1.7% ~ 50.1% on DiffusionForensics dataset. For cross-scene detection, APN maintains its high performance. Via visualization analysis, we find that the proposed method extracts flexible forgery patterns and condenses the forgery information diluted in irrelevant features. We also find that the artifact features APN focuses on across generators and scenes are global and diverse. The code will be available on GitHub.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issue of performance degradation in AI-generated image detection in cross-domain scenarios (different generators and different image scenes). Specifically, existing methods for detecting generated images experience significant performance drops when dealing with images from different generators or different scenes. To solve this problem, the authors propose the "Artifact Purification Network" (APN), which extracts artifact features through explicit and implicit purification processes. Explicit purification includes methods for separating artifact features from the frequency spectrum and spatial features; implicit purification further purifies these artifact features based on a mutual information estimation strategy, thereby improving cross-domain detection performance. Experimental results show that APN outperforms 10 existing methods in both cross-generator and cross-scene detection, with average precision improvements of 5.6%~16.4% and 1.7%~50.1% on two datasets, respectively.