Reviewing FID and SID Metrics on Generative Adversarial Networks

Ricardo de Deijn,Aishwarya Batra,Brandon Koch,Naseef Mansoor,Hema Makkena

DOI: https://doi.org/10.5121/csit.2024.140208

2024-02-06

Abstract:The growth of generative adversarial network (GAN) models has increased the ability of image processing and provides numerous industries with the technology to produce realistic image transformations. However, with the field being recently established there are new evaluation metrics that can further this research. Previous research has shown the Fréchet Inception Distance (FID) to be an effective metric when testing these image-to-image GANs in real-world applications. Signed Inception Distance (SID), a founded metric in 2023, expands on FID by allowing unsigned distances. This paper uses public datasets that consist of façades, cityscapes, and maps within Pix2Pix and CycleGAN models. After training these models are evaluated on both inception distance metrics which measure the generating performance of the trained models. Our findings indicate that usage of the metric SID incorporates an efficient and effective metric to complement, or even exceed the ability shown using the FID for the image-to-image GANs

Computer Vision and Pattern Recognition,Image and Video Processing

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper primarily explores the application of Generative Adversarial Networks (GANs) in image processing and evaluates two new evaluation metrics—Fréchet Inception Distance (FID) and Signed Inception Distance (SID)—to measure the performance of GANs in image-to-image translation tasks. #### Main Issues: 1. **Effectiveness of Evaluation Metrics**: Investigating how FID and SID can better assess the performance of GAN models in image-to-image translation tasks. 2. **Diversity of GAN Models**: Addressing issues such as mode collapse, instability, and lack of diversity in current GAN models. 3. **Performance on Different Datasets**: Evaluating the performance of Pix2Pix and CycleGAN models on various public datasets (e.g., building facades, cityscapes, and maps) and comparing their scores under FID and SID metrics. #### Specific Objectives: - Compare the effectiveness of FID and SID metrics on Pix2Pix and CycleGAN models across different datasets. - Explore the impact of different training epochs on the performance of GAN models. - Analyze the effect of different dataset sizes on the performance of GAN models. Through these studies, the paper aims to find more effective evaluation methods to improve the performance of GAN models in image-to-image translation tasks. Specifically, SID, as a new evaluation metric, can better capture the diversity and differences between generated images and real images, thereby providing more accurate assessment results.

Reviewing FID and SID Metrics on Generative Adversarial Networks

Using Skew to Assess the Quality of GAN-generated Image Features

Rethinking FID: Towards a Better Evaluation Metric for Image Generation

Compound Frechet Inception Distance for Quality Assessment of GAN Created Images

On Aliased Resizing and Surprising Subtleties in GAN Evaluation

Evaluating Text-to-Image GANs Performance: A Comparative Analysis of Evaluation Metrics

F?D: On understanding the role of deep feature spaces on face generation evaluation

Improving Sample-based Evaluation for Generative Adversarial Networks

A Study on the Evaluation of Generative Models

An Improved Evaluation Framework for Generative Adversarial Networks.

The Role of ImageNet Classes in Fréchet Inception Distance

On the Evaluation of Generative Adversarial Networks By Discriminative Models

Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend

Effectively Unbiased FID and Inception Score and where to find them

A novel measure to evaluate generative adversarial networks based on direct analysis of generated images

Frećhet Denoised Distance: Enhancing Plausibility Evaluation for Generated Designs with Denoising Autoencoder

On the Distributed Evaluation of Generative Models

SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection Methods

An Optimism-based Approach to Online Evaluation of Generative Models

Quality Evaluation of GANs Using Cross Local Intrinsic Dimensionality