Understanding GANs: fundamentals, variants, training challenges, applications, and open problems
Zeeshan Ahmad,Zain ul Abidin Jaffri,Meng Chen,Shudi Bao,Jaffri, Zain ul Abidin
DOI: https://doi.org/10.1007/s11042-024-19361-y
IF: 2.577
2024-05-15
Multimedia Tools and Applications
Abstract:Generative adversarial networks (GANs), a novel framework for training generative models in an adversarial setup, have attracted significant attention in recent years. The two opposing neural networks of the GANs framework, i.e., a generator and a discriminator, are trained simultaneously in a zero-sum game, where the generator generates images to fool the discriminator that is trained to discriminate between real and synthetic images. In this paper, we provide a comprehensive review about the recent developments in GANs. Firstly, we introduce various deep generative models, basic theory and training mechanism of GANs, and the latent space. We further discuss several representative variants of GANs. Although GANs have been successfully utilized in various applications, they are known to be highly unstable to train. Generally, there is a lack of understanding as to how GANs converge. We briefly discuss the sources of instability and convergence issues in GANs from the perspectives of statistics, game theory and control theory, and describe several techniques for their stable training. Evaluating GANs has been a challenging task, as there is no consensus yet reached on which measure is more suitable for model comparison. Therefore, we provide a brief discussion on quantitative and qualitative evaluation measures for GANs. Then, we conduct several experiments to compare representative GANs variants based on these evaluation metrics. Furthermore, the application areas of GANs are briefly discussed. Finally, we outline several important open issues and future research trends in GANs.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering