Abstract:Person-agnostic face swapping has gained significant attention in recent years, as it offers the potential to enhance various real-world applications by combining high fidelity and identity consistency. However, conventional face swapping methods often rely on intricate adjustments of different loss functions, leading to instability during both the training and inference stages. In this work, we propose a simple yet effective framework named StableSwap with a reversible autoencoder to modify the face in a shared latent space. Our approach capitalizes on the information-rich image latent codes to tackle the challenges of complex editing tasks, utilizing the abundant details present in both the source and target faces. To ensure an expressive and robust latent space, we employ a latent alignment approach with perceptual and adversarial losses to optimize the autoencoder. Additionally, we devise a multi-stage identity injection module that samples multiple features with different facial priors and incorporates them to guide the latent image manipulation. By leveraging attention-based blocks, we fuse these futures and update the latent code in a mask-conditioned manner. Both quantitative and qualitative results on the mainstream benchmarks demonstrate that our StableSwap generates competitive identity-consistent swapped faces compared with state-of-the-art methods. Our method outperforms previous approaches in terms of ID Retrieval (98.68) and FID (2.49), while also exhibiting enhanced stability during model training. Beyond this, our model achieves region-controllable face swapping with the capability to perform more fine-grained operations in latent space.

Temporal Optimization for Face Swapping Video Based on Consistency Inheritance

FaceSwapNet: Landmark Guided Many-to-Many Face Reenactment

Region-Aware Face Swapping

Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping

SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space

Face Swapping Consistency Transfer with Neural Identity Carrier.

MobileFaceSwap: A Lightweight Framework for Video Face Swapping

Attribute-Aware Head Swapping Guided by 3d Modeling

FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping

HiFiVFS: High Fidelity Video Face Swapping

StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space

UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing

DiffSwap: High-Fidelity and Controllable Face Swapping Via 3D-Aware Masked Diffusion

Identity-Preserving Face Swapping via Dual Surrogate Generative Models

ReliableSwap: Boosting General Face Swapping Via Reliable Supervision

An Efficient Attribute-Preserving Framework for Face Swapping

Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models

Task-agnostic Temporally Consistent Facial Video Editing

AGIL-SwinT: Attention-guided Inconsistency Learning for Face Forgery Detection

SimSwap++: Towards Faster and High-Quality Identity Swapping

High-Fidelity Face Swapping with Style Blending