Tackling mode collapse in multi-generator GANs with orthogonal vectors

Wei Li,Li Fan,Zhenyu Wang,Chao Ma,Xiaohui Cui
DOI: https://doi.org/10.1016/j.patcog.2020.107646
IF: 8
2021-02-01
Pattern Recognition
Abstract:<p>Generative Adversarial Networks (GANs) have been widely used to generate realistic-looking instances. However, training robust GAN is a non-trivial task due to the problem of mode collapse. Although many GAN variants are proposed to overcome this problem, they have limitations. Those existing studies either generate identical instances or result in negative gradients during training. In this paper, we propose a new approach to training GAN to overcome mode collapse by employing a set of generators, an encoder and a discriminator. A new minimax formula is proposed to simultaneously train all components in a similar spirit to vanilla GAN. The orthogonal vector strategy is employed to guide multiple generators to learn different information in a complementary manner. In this way, we term our approach <em>Multi-Generator Orthogonal GAN</em> (MGO-GAN). Specifically, the synthetic data produced by those generators are fed into the encoder to obtain feature vectors. The orthogonal value is calculated between any two feature vectors, which loyally reflects the correlation between vectors. Such a correlation indicates how different information has been learnt by generators. The lower the orthogonal value is, the more different information the generators learn. We minimize the orthogonal value along with minimizing the generator loss through back-propagation in the training of GAN. The orthogonal value is integrated with the original generator loss to jointly update the corresponding generator's parameters. We conduct extensive experiments utilizing MNIST, CIFAR10 and CelebA datasets to demonstrate the significant performance improvement of MGO-GAN in terms of generated data quality and diversity at different resolutions.</p>
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore the relationship between the expressive power of Hybrid Temporal Logic (HTL) on data words and that of Linear - time Temporal Logic LTL↓. Specifically, the researchers focus on the following aspects: 1. **Comparison of expressive power**: The paper shows that there are some properties of data words that can be expressed in HTL using two variables but cannot be expressed in LTL↓. At the same time, the paper also proves that all properties that can be expressed in HTL using only one variable can also be expressed in LTL↓ using only one variable. 2. **Comparison of succinctness**: The paper further studies the succinctness of HTL relative to LTL↓. The results show that the formulas of HTL1 (i.e., HTL using only one variable) can be exponentially more succinct than the formulas of LTL↓, and the formulas of HTL can even express some properties expressible in LTL↓ in a non - elementary more succinct way. 3. **Variable hierarchy**: The paper also explores the variable hierarchy of HTL on data strings and proves that this hierarchy is infinite. This means that as the number of variables increases, the properties that HTL can express also keep increasing and will not stop growing at a certain point. In summary, the main objective of this paper is to clarify the advantages and limitations of HTL in terms of expressive power and succinctness relative to LTL↓ on data words through theoretical analysis and formal proof, thereby providing a theoretical basis for the choice between these two logics in practical applications.