IPGAN: Identity-Preservation Generative Adversarial Network for unsupervised photo-to-caricature translation

Lan Yan,Wenbo Zheng,Chao Gou,Fei-Yue Wang
DOI: https://doi.org/10.1016/j.knosys.2022.108223
2022-04-01
Abstract:Photo-to-caricature translation is an extremely challenging task because there are not only texture differences between caricatures and photos, but also various spatial deformations in caricatures. Most of existing methods tend to introduce difficult obtained additional information such as precise facial landmarks to guide caricature generation. In addition, identity preservation is a crucial characteristic of caricatures, but unfortunately there seems to be few methods to consider it. Motivated by the aforementioned observations, we propose an Identity-Preservation Generative Adversarial Network (IPGAN) for unsupervised photo-to-caricature translation. In particular, considering the importance of identity retention, we propose a novel identity preservation loss to hold the identity information of original photos and improve the quality of generated caricatures. To capture realistic caricature styles, we design a style differentiation loss to help our model produce caricatures with styles that remarkably differ from photos. Moreover, to learn satisfactory deformations without supervision, our model uses a warp controller to acquire exaggerations automatically that enable to customize diverse exaggerations. As an unsupervised translation method, our IPGAN can also be applied to caricature-to-photo translation. Experiments on the WebCaricature dataset suggest that our IPGAN achieves state-of-the-art performance and can generate realistic as well as identity preservation caricatures.
computer science, artificial intelligence
What problem does this paper attempt to address?