Sparse-Aware Transformer for Single Image Super-Resolution

Qingtang Ding,Jungang Yang
DOI: https://doi.org/10.1109/cbase60015.2023.10439116
2023-01-01
Abstract:In the current era of continuous innovation in deep learning technology, the performance of single image super-resolution (SR) has been further improved. Recently, some researchers have introduced the Transformer architecture in the field of single image super-resolution, as it has a significant advantage in capturing long-range correlations.However, Transformer network densely samples tokens from the original image to calculate the multi-head self attention (MHSA) using all the tokens. Due to the sparsity of SR tasks, SR networks mainly focus more on texture regions and edges of images to learn nonlinear mapping between high-resolution (HR) and low-resolution (LR) images. Therefore, tokens sampled from flat image regions in the Transformer-based SR network are less informative, which causes lots of redundant calculation. For the purpose of this study, in order to improve the efficiency of Transformer based SR networks, we chose to use the sparsity of SR tasks, which can also effectively maintain the modeling of long-range dependencies. So, we design and develop a sparse perception converter network (SAT) for single image SR. The experimental results show that the SAT designed in this article can make calculations easier and make SR performance more competitive on the SR benchmark dataset.
What problem does this paper attempt to address?