Tackling mode collapse in multi-generator GANs with orthogonal vectors

Wei Li,Li Fan,Zhenyu Wang,Chao Ma,Xiaohui Cui

DOI: https://doi.org/10.1016/j.patcog.2020.107646

IF: 8

2021-02-01

Pattern Recognition

Abstract:<p>Generative Adversarial Networks (GANs) have been widely used to generate realistic-looking instances. However, training robust GAN is a non-trivial task due to the problem of mode collapse. Although many GAN variants are proposed to overcome this problem, they have limitations. Those existing studies either generate identical instances or result in negative gradients during training. In this paper, we propose a new approach to training GAN to overcome mode collapse by employing a set of generators, an encoder and a discriminator. A new minimax formula is proposed to simultaneously train all components in a similar spirit to vanilla GAN. The orthogonal vector strategy is employed to guide multiple generators to learn different information in a complementary manner. In this way, we term our approach <em>Multi-Generator Orthogonal GAN</em> (MGO-GAN). Specifically, the synthetic data produced by those generators are fed into the encoder to obtain feature vectors. The orthogonal value is calculated between any two feature vectors, which loyally reflects the correlation between vectors. Such a correlation indicates how different information has been learnt by generators. The lower the orthogonal value is, the more different information the generators learn. We minimize the orthogonal value along with minimizing the generator loss through back-propagation in the training of GAN. The orthogonal value is integrated with the original generator loss to jointly update the corresponding generator's parameters. We conduct extensive experiments utilizing MNIST, CIFAR10 and CelebA datasets to demonstrate the significant performance improvement of MGO-GAN in terms of generated data quality and diversity at different resolutions.</p>

computer science, artificial intelligence,engineering, electrical & electronic

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to explore the relationship between the expressive power of Hybrid Temporal Logic (HTL) on data words and that of Linear - time Temporal Logic LTL↓. Specifically, the researchers focus on the following aspects: 1. **Comparison of expressive power**: The paper shows that there are some properties of data words that can be expressed in HTL using two variables but cannot be expressed in LTL↓. At the same time, the paper also proves that all properties that can be expressed in HTL using only one variable can also be expressed in LTL↓ using only one variable. 2. **Comparison of succinctness**: The paper further studies the succinctness of HTL relative to LTL↓. The results show that the formulas of HTL1 (i.e., HTL using only one variable) can be exponentially more succinct than the formulas of LTL↓, and the formulas of HTL can even express some properties expressible in LTL↓ in a non - elementary more succinct way. 3. **Variable hierarchy**: The paper also explores the variable hierarchy of HTL on data strings and proves that this hierarchy is infinite. This means that as the number of variables increases, the properties that HTL can express also keep increasing and will not stop growing at a certain point. In summary, the main objective of this paper is to clarify the advantages and limitations of HTL in terms of expressive power and succinctness relative to LTL↓ on data words through theoretical analysis and formal proof, thereby providing a theoretical basis for the choice between these two logics in practical applications.

Tackling mode collapse in multi-generator GANs with orthogonal vectors

Soft Generative Adversarial Network: Combating Mode Collapse in Generative Adversarial Network Training Via Dynamic Borderline Softening Mechanism

Multi-generator GAN learning disconnected manifolds with mutual information

Dual Distribution Matching GAN

Study of Prevention of Mode Collapse in Generative Adversarial Network (GAN)

Reducing Mode Collapse With Monge–Kantorovich Optimal Transport for Generative Adversarial Networks

STM-GAN: Sequentially Trained Multiple Generators for Mitigating Mode Collapse

Towards the Gradient Vanishing, Divergence Mismatching and Mode Collapse of Generative Adversarial Nets

InvertGAN: Reducing Mode Collapse with Multi-Dimensional Gaussian Inversion

Combating Mode Collapse via Offline Manifold Entropy Estimation

Combating Mode Collapse in GANs via Manifold Entropy Estimation

On Mode Collapse in Generative Adversarial Networks

VirtualGAN: Reducing Mode Collapse in Generative Adversarial Networks Using Virtual Mapping

Overcoming Mode Collapse with Adaptive Multi Adversarial Training

BEGAN v3: Avoiding Mode Collapse in GANs Using Variational Inference

Shared Loss between Generators of GANs

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

Antimode collapse generative adversarial networks