Abstract:Advancements in deep learning algorithms and high-performance GPU computation have spurred extensive research in image processing. This study focuses on the synthesis of super-resolution images, a technique aimed at generating high-resolution images from low-resolution images that contain various types of degradation and noise. In recent years, the emergence of convolutional neural networks and adversarial generative networks has made it possible to generate higher quality images. In this research, we propose two approaches to enhance the image quality based on a mainstream algorithm ESRGAN, a GAN-based generator and discriminator model architecture. The first approach applies two types of channel attention (SENet and ECA-Net) to the generator. SENet explicitly models the interdependence of convolutional features across channels, thereby enhancing the quality of the representation generated by the network. ECA-Net decomposes the channel attention of SENet to reduce model complexity while preserving performance through appropriate cross-channel interactions. The second approach applies LPIPS to the image evaluation methods of the discriminator. LPIPS serving as an image quality assessment metric, enhances perceptual evaluation by combining feature extraction from a pre-trained neural network using human evaluation data. To assess the effectiveness of our proposed methods, we employed four benchmark datasets for synthesizing super-resolution images. We used two image quality evaluation metrics: NIQE, which evaluates the naturalness of images, and LPIPS, which provides a human-like perceptual evaluation results. Experimental results demonstrate a significant enhancement in image naturalness and perceptual evaluation values compared to previous studies highlighting the effectiveness of the proposed methods.

Improved Image Synthesis with Attention Mechanism for Virtual Scenes Via UAV Imagery

Fine-grained Semantic Constraint in Image Synthesis

SIMGAN: Photo-Realistic Semantic Image Manipulation Using Generative Adversarial Networks.

Remote Sensing Image Synthesis Via Semantic Embedding Generative Adversarial Networks.

Fine-Grained Semantic Image Synthesis with Object-Attention Generative Adversarial Network

Dual Attention GANs for Semantic Image Synthesis

Spatial Fusion GAN for Image Synthesis

Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis

GAN for Semantic Image Synthesis With Laplacian Pyramid and Multi-Scale Channel Attention

Semantic Image Synthesis Via Diffusion Models

CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion

SAM-GAN: Self-Attention supporting Multi-stage Generative Adversarial Networks for text-to-image synthesis

Multi-level Gate Feature Aggregation with Spatially Adaptive Batch-Instance Normalization for Semantic Image Synthesis

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network

Hybrid Attention Driven Text-To-Image Synthesis Via Generative Adversarial Networks

GAN with opposition-based blocks and channel self-attention mechanism for image synthesis

Remote Sensing Image Synthesis Via Graphical Generative Adversarial Networks.

Revisiting the Evaluation of Image Synthesis with GANs

VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image

Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding

CA-ESRGAN: Super-Resolution Image Synthesis Using Channel Attention-Based ESRGAN