Abstract:Unpaired image translation with feature-level constraints presents significant challenges, including unstable network training and low diversity in generated tasks. This limitation is typically attributed to the following situations: 1. The generated images are overly simplistic, which fails to stimulate the network's capacity for generating diverse and imaginative outputs. 2. The images produced are distorted, a direct consequence of unstable training conditions. To address this limitation, the unpaired image-to-image translation with diffusion adversarial network (UNDAN) is proposed. Specifically, our model consists of two modules: (1) Feature fusion module: In this module, one-dimensional SVD features are transformed into two-dimensional SVD features using the convolutional two-dimensionalization method, enhancing the diversity of the images generated by the network. (2) Network convergence module: In this module, the generator transitions from the U-net model to a superior diffusion model. This shift leverages the stability of the diffusion model to mitigate the mode collapse issues commonly associated with adversarial network training. In summary, the CycleGAN framework is utilized to achieve unpaired image translation through the application of cycle-consistent loss. Finally, the proposed network was verified from both qualitative and quantitative aspects. The experiments show that the method proposed can generate more realistic converted images.

Unpaired image-to-image translation with improved two-dimensional feature

Multimodal Image-to-Image Translation via Mutual Information Estimation and Maximization

Unpaired Image-to-Image Translation with Diffusion Adversarial Network

Unpaired Salient Object Translation Via Spatial Attention Prior

Unsupervised Image-to-Image Translation with Generative Adversarial Networks.

Trans-Cycle: Unpaired Image-to-Image Translation Network by Transformer

A one-to-many conditional generative adversarial network framework for multiple image-to-image translations

ITTR: Unpaired Image-to-Image Translation with Transformers

Unformer: A Transformer-Based Approach for Adaptive Multi-Scale Feature Aggregation in Underwater Image Enhancement

Towards Instance-level Image-to-Image Translation

UCGAN: a contrastive learning-based unaligned image translation model with importance reweighting

Towards Visual Feature Translation

UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation

Unsupervised Multi-Domain Multimodal Image-to-image Translation with Explicit Domain-Constrained Disentanglement.

Unpaired Image-to-Image Translation Using Adversarial Consistency Loss

Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients

UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models

Feature-attention Module for Context-Aware Image-to-image Translation

Contrastive Learning with Attention Mechanism and Multi-Scale Sample Network for Unpaired Image-to-Image Translation

Unbalanced Feature Transport for Exemplar-based Image Translation

General Image-to-Image Translation with One-Shot Image Guidance