Abstract:Face cross-domain translation aims at mapping face images from one image domain to another. Com-mon face image translation tasks include face photo-sketch and face photo-APDrawing cross-domain translation, which can be widely applied in real-world scenarios, such as criminal investigation, movie production, and digital entertainment. However, due to the limited face image pairs and the great gap of color and texture between diﬀer-ent domains, face image cross-domain translation still faces many challenges. Existing methods usually produce blurring, artifacts, and structural distortion, leading to poor visualization quality. To tackle this problem, we propose a self-discriminative cycle generative adversarial network, in which the generator adopts an encoder-decoder structure and the corresponding discriminator is the encoder of the other generator in the reverse direction. In the self-discriminative manner, the encoder (i.e., discriminator) cleverly incorporates “True/False” semantic information and the sensitivity to pixel-level information, thereby enhancing the robustness and generalization ability of the generative model. Besides, we propose a novel omni-directional pixel-gradient loss. The designed convolution kernel calculates the gradients of all directions around each pixel to extract the gradient information. Our model is motivated to eﬀectively learn the continuous inter-pixel variation pattern by constraining the gradient information of generated images and ground-truth images to be consistent. The omni-directional pixel-gradient loss can be ﬂexibly applied to other generative models and improve their performance. Extensive experiments show that the proposed framework can produce advanced results on the paired face photo-sketch datasets (CUFS, CUFSF) and the photo-APDrawing dataset (APDrawing). We further demonstrate the strong generalization ability of our model on real-world data and the excellent performance on unpaired face datasets.

PROMOTE: Prior-Guided Diffusion Model with Global-Local Contrastive Learning for Exemplar-Based Image Translation

Image Cross-Domain Translation Algorithm Based on Self-Similarity and Contrastive Learning

Unpaired Salient Object Translation Via Spatial Attention Prior

Multimodal Image-to-Image Translation via Mutual Information Estimation and Maximization

EDIT: Exemplar-Domain Aware Image-to-Image Translation

Image-to-Image Translation with Multi-Path Consistency Regularization

Target-Guided Diffusion Models for Unpaired Cross-modality Medical Image Translation

Edge-guided Adversarial Network Based on Contrastive Learning for Image-to-Image Translation

GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation

Unsupervised Image-to-Image Translation with Generative Prior

EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models

MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond

Palette: Image-to-Image Diffusion Models

Unbalanced Feature Transport for Exemplar-based Image Translation

Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation

Unsupervised content and style learning for multimodal cross-domain image translation

Self-discriminative Cycle Generative Adversarial Networks for Face Image Translation

Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation