Lightweight image super-resolution network based on extended convolution mixer

Garas Gendy,Nabil Sabor,Guanghui He
DOI: https://doi.org/10.1016/j.engappai.2024.108069
IF: 8
2024-02-21
Engineering Applications of Artificial Intelligence
Abstract:The single image super-resolution (SISR) is a computer vision task needed in many real-world applications. There are many methods developed to solve ill-posed SISR problem; however, these methods are based on attention mechanisms that need a large computing processing cost. So, these attention-based models cannot be used in real-world applications that need fast models. Thus, we propose an enhanced convolution mixer (EConvMixer) module to solve this SISR problem by using lower computing convolution layers. The EConvMixer is designed based on utilizing three convolution types, namely the dilated depthwise convolution for increasing the receptive field, the depthwise convolution for mixing spatial locations, and the pointwise convolution for mixing channel locations. Based on using this EConvMixer layer, we build a lightweight extended convolution mixer network (EConvMixN) for SR images. The EConvMixN has the spirit of the transformer model but with a low computational complexity using only convolution layers. It is clear that our model achieves appealing visual quality and reconstruction accuracy. Also, the EConvMixN model is faster than the state-of-the-art results at different SR scales. Moreover, the EConvMixN achieves state-of-the-art runtime in multiple SR scales. Finally, our model improves PSNR compared to CoMoNet-S by 0.12 dB and 0.08 dB for datasets of Set5 and Set14 at the scale of × 3.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?