A novel measure to evaluate generative adversarial networks based on direct analysis of generated images

Shuyue Guan,Murray Loew
DOI: https://doi.org/10.1007/s00521-021-06031-5
2021-05-15
Neural Computing and Applications
Abstract:The generative adversarial network (GAN) is a state-of-the-art technique in the field of deep learning. A number of recent papers address the theory and applications of GANs in various fields of image processing. Fewer studies, however, have directly evaluated GAN outputs. Those that have been conducted focused on using classification performance, <i>e.g.</i>, <i>inception score</i> (IS) and statistical metrics, <i>e.g.</i>, <i>Fréchet inception distance</i> (FID). Here, we consider a fundamental way to evaluate GANs by directly analyzing the images they generate, instead of using them as inputs to other classifiers. We characterize the performance of a GAN as an image generator according to three aspects: 1) creativity: non-duplication of the real images; 2) inheritance: generated images should have the same style, which retains key features of the real images; and 3) diversity: generated images are different from each other. A GAN should not generate a few different images repeatedly. Based on the three aspects of ideal GANs, we have designed the <i>likeness score</i> (LS) to evaluate GAN performance and have applied it to evaluate several typical GANs. We compared our proposed measure with two commonly used GAN evaluation methods: IS and FID, and four additional measures. Furthermore, we discuss how these evaluations could help us deepen our understanding of GANs and improve their performance. The code can be found in author's website linked up with the ORCID: <a href="https://orcid.org/0000-0002-3779-9368">https://orcid.org/0000-0002-3779-9368</a>.
computer science, artificial intelligence
What problem does this paper attempt to address?