A Lightweight Distillation Recurrent Convolution Network on FPGA for Real-Time Video Super-Resolution

Zhaowen Zheng,Yuqiao Huang,Dihu Chen
DOI: https://doi.org/10.1007/s00530-024-01528-0
IF: 3.9
2024-01-01
Multimedia Systems
Abstract:In the application of image super-resolution (SR) based on field-programmable gate array (FPGA), depthwise separable convolution is widely utilized. However, existing network designs overly simplify the structures used for deep feature extraction to conserve on-chip memory resources, which compromises network performance. Despite this, they still consume relatively large amounts of hardware resources. This study adopts a hardware-software co-design approach, proposing a hardware-friendly algorithm based on recurrent convolution neural network (RCNN), and implemented on FPGA. The algorithm employs a lightweight information multi-distillation block (LIMDB) to deeply extract and distill feature information, enhancing the network’s receptive field and improving its capability to extract feature information. Concurrently, the network reduces the dimensions of hidden state, enhancing performance and decreasing data transmission. Experimental results demonstrate that our lightweight deep recurrent convolution network (LDRCN) significantly outperforms other methods on common datasets. At the hardware level, we designed an efficient pipelining structure that combines skip connection and line buffer shared storage with a lossless segmentation computational strategy to reduce on-chip memory usage. Additionally, a method for finding mixed-precision quantization bit widths was designed to significantly reduce computational resource consumption while ensuring accuracy. The synthesis results on the ZYNQ XCZU9EG platform show that, compared to similar RNN-based works, the computation speed increased by 24.3
What problem does this paper attempt to address?