Abstract:Super-resolution systems refer to computer-based systems designed to enhance the quality of images or video by producing high-resolution renditions from low-resolution counterparts using computational algorithms and technologies. Various methods and techniques have been used in development of super-resolution systems. The development of Convolution Neural Networks (CNNs) and the Deep Learning (DL) methods have outperformed traditional methods. However, as models become increasingly deeper with wider receptive fields, the number of parameters significantly increases. While this often results in better performance, it renders these models impractical for real-life scenarios such as smartphones or other mobile systems. Currently, most proposed methods with higher perceptual quality demand a substantial amount of time to process a single image, even on powerful hardware like NVIDIA GPUs. Such computationally expensive models are not cost-effective for real-world application scenarios. Optimization is needed to reduce the computational costs and memory requirements to enhance their suitability for less powerful hardware configurations. In this work, we propose an efficient binary neural network architecture, ResBinESPCN, designed for image super-resolution. In our design, we improved the energy efficiency of the architecture through algorithmic and hardware-level optimizations. These optimizations not only enhance computational efficiency and reduce memory consumption but also achieve effective image super-resolution in resource-constrained environments. Our experimental validation highlights the effectiveness of this network structure and includes ablation studies on models with varying data bit widths. Hardware analysis substantiates the efficiency and real-time capabilities of this model. Additionally, deploying the model on FPGA using FINN demonstrates its low hardware resource usage and low power consumption.

HDSuper: High-Quality and High Computational Utilization Edge Super-Resolution Accelerator With Hardware-Algorithm Co-Design Techniques

A High-Performance Accelerator for Real-Time Super-Resolution on Edge FPGAs

ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network

A Convolutional Neural Network Accelerator Architecture with Fine-Granular Mixed Precision Configurability.

A 28-nm Computing-in-Memory-Based Super-Resolution Accelerator Incorporating Macro-Level Pipeline and Texture/Algebraic Sparsity

ACNPU: A 4.75TOPS/W 1080P@30FPS Super Resolution Accelerator with Decoupled Asymmetric Convolution

Single Image Super-Resolution Via the Implementation of the Hardware-Friendly Sparse Coding

CNN Acceleration With Hardware-Efficient Dataflow for Super-Resolution

Efficient Super-Resolution System with Block-Wise Hybridization and Quantized Winograd on FPGA

FPGA Implementation of Feature Detection Algorithm Based on High Level Synthesis

Efficient FPGA Binary Neural Network Architecture for Image Super-Resolution

UArch: A Super-Resolution Processor with Heterogeneous Triple-Core Architecture for Workloads of U-Net Networks

A High-Performance FPGA-Based Depthwise Separable Convolution Accelerator

Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution

A Weight-Reload-Eliminated Compute-in-Memory Accelerator for 60 fps 4K Super-Resolution

Myocarditis: A clinical entity that can benefit from noninvasive imaging

Hardware Implementation of Depthwise Separable Convolution Neural Network

FPGA-Based Real-Time Super-Resolution System for Ultra High Definition Videos

HISP: Heterogeneous Image Signal Processor Pipeline Combining Traditional and Deep Learning Algorithms Implemented on FPGA

An FPGA-Based Residual Recurrent Neural Network for Real-Time Video Super-Resolution

A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU