Temporal Optimization for Face Swapping Video Based on Consistency Inheritance

Zhijian Deng,Wenbo Zhou,Kunlin Liu,Weiming Zhang,Nenghai Yu
DOI: https://doi.org/10.1145/3674399.3674457
2024-01-01
Abstract:Applying existing face swapping algorithms independently to each video frame typically leads to temporal inconsistency. We analyze the inconsistency in the generated results and model inter-frame inconsistency as time-domain noise. We propose a face swapping mapper network to inherit identity and suppress noise. Training strategies include primary perceptual loss to learn the face swapping information of the reference face, optical flow loss to impose temporal constraints, and identity loss to transfer identity information. In addition, we introduce a 3D face disentanglement model to regress FLAME parameters and guide the optimization direction precisely for facial detail consistency. Only a pair of original and swapped videos is used for training, eliminating the need for a large dataset. Experiments demonstrate that we improve the temporal consistency and detail consistency of the results, and enhance the generation quality of face swapping methods at the video level.
What problem does this paper attempt to address?