CoDeGAN: Contrastive Disentanglement for Generative Adversarial Network

Jiangwei Zhao,Zejia Liu,Xiaohan Guo,Lili Pan

2024-05-31

Abstract:Disentanglement, a critical concern in interpretable machine learning, has also garnered significant attention from the computer vision community. Many existing GAN-based class disentanglement (unsupervised) approaches, such as InfoGAN and its variants, primarily aim to maximize the mutual information (MI) between the generated image and its latent codes. However, this focus may lead to a tendency for the network to generate highly similar images when presented with the same latent class factor, potentially resulting in mode collapse or mode dropping. To alleviate this problem, we propose \texttt{CoDeGAN} (Contrastive Disentanglement for Generative Adversarial Networks), where we relax similarity constraints for disentanglement from the image domain to the feature domain. This modification not only enhances the stability of GAN training but also improves their disentangling capabilities. Moreover, we integrate self-supervised pre-training into CoDeGAN to learn semantic representations, significantly facilitating unsupervised disentanglement. Extensive experimental results demonstrate the superiority of our method over state-of-the-art approaches across multiple benchmarks. The code is available at <a class="link-external link-https" href="https://github.com/learninginvision/CoDeGAN" rel="external noopener nofollow">this https URL</a>.

Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve better representation disentanglement in Generative Adversarial Networks (GANs), especially when dealing with the disentanglement of discrete factors. Specifically, existing GAN - based disentanglement methods, such as InfoGAN and its variants, mainly achieve disentanglement by maximizing the Mutual Information (MI) between the generated images and their latent codes. However, this method may cause the generated images to be too similar when given the same latent class factors, thus leading to the problems of mode collapse or mode dropping. To solve these problems, the authors propose the Contrastive Disentanglement for Generative Adversarial Networks (CoDeGAN). The main contributions of CoDeGAN include: 1. **Relaxing the similarity constraint**: Transfer the similarity constraint of disentanglement from the image domain to the feature domain, which not only enhances the stability of GAN training but also improves its disentanglement ability. 2. **Self - supervised pre - training**: Integrate self - supervised pre - training into CoDeGAN to learn semantic representations, which significantly promotes unsupervised disentanglement. 3. **Performance improvement**: Experimental results show that CoDeGAN outperforms existing methods on multiple benchmark datasets. Especially on the CIFAR - 10 dataset, it achieves an absolute improvement of 19% and 16% compared to InfoGAN and the previous state - of - the - art method respectively. Through these improvements, CoDeGAN aims to improve the quality of generated images and the accuracy of disentanglement, while reducing the risks of mode collapse and mode dropping.

CoDeGAN: Contrastive Disentanglement for Generative Adversarial Network

DiscoGNN: A Sample-Efficient Framework for Self-Supervised Graph Representation Learning

Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View

Disentangling Factors of Variation in Deep Representations Using Adversarial Training.

CDE-GAN: Cooperative Dual Evolution Based Generative Adversarial Network

GenCo: Generative Co-training for Generative Adversarial Networks with Limited Data

ComGAN: Unsupervised Disentanglement and Segmentation via Image Composition

Representation Decomposition for Image Manipulation and Beyond

Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

OOGAN: Disentangling GAN with One-Hot Sampling and Orthogonal Regularization

DEGAN: Decompose-Enhance-GAN Network for Simultaneous Low-Light Image Lightening and Denoising

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

Inference-InfoGAN: Inference Independence via Embedding Orthogonal Basis Expansion

Generative Semantic Manipulation with Contrasting GAN

Disentangled Inference for GANs With Latently Invertible Autoencoder

Towards the Gradient Vanishing, Divergence Mismatching and Mode Collapse of Generative Adversarial Nets

Dist-GAN: An Improved GAN using Distance Constraints

InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning

CoDeGAN: Contrastive Disentanglement for Generative Adversarial Network

DiscoGNN: A Sample-Efficient Framework for Self-Supervised Graph Representation Learning

Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View

Disentangling Factors of Variation in Deep Representations Using Adversarial Training.

CDE-GAN: Cooperative Dual Evolution Based Generative Adversarial Network

GenCo: Generative Co-training for Generative Adversarial Networks with Limited Data

ComGAN: Unsupervised Disentanglement and ﻿Segmentation via Image Composition

Representation Decomposition for Image Manipulation and Beyond

Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

OOGAN: Disentangling GAN with One-Hot Sampling and Orthogonal Regularization

DEGAN: Decompose-Enhance-GAN Network for Simultaneous Low-Light Image Lightening and Denoising

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

Inference-InfoGAN: Inference Independence via Embedding Orthogonal Basis Expansion

Generative Semantic Manipulation with Contrasting GAN

Disentangled Inference for GANs With Latently Invertible Autoencoder

Towards the Gradient Vanishing, Divergence Mismatching and Mode Collapse of Generative Adversarial Nets

Dist-GAN: An Improved GAN using Distance Constraints

InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning

ComGAN: Unsupervised Disentanglement and Segmentation via Image Composition