Non-local self-attention network for image super-resolution

Kun Zeng,Hanjiang Lin,Zhiqiang Yan,Jinsheng Fang,Taotao Lai,Lin, Hanjiang,Lai, Taotao
DOI: https://doi.org/10.1007/s10489-024-05343-y
IF: 5.3
2024-04-21
Applied Intelligence
Abstract:The utilization of self-attention mechanisms in Transformer-based methods has shown great potential in addressing the image super-resolution (SR) task by capturing long-range dependencies. However, many existing Transformer-based methods for SR extract features locally within a small window and rely on shifted window self-attention to gradually incorporate long-range dependencies. These methods may not effectively exploit non-local image information for SR. To overcome this limitation, we propose a novel non-local self-attention (NLSA) mechanism that directly models non-local dependencies. Firstly, NLSA utilizes locality-sensitive hashing to identify similar pixel-wise features with minimal computational cost. Next, a pixel-shuffling operation is applied to gather similar features within the same window. This pixel-shuffling technique effectively expands the receptive field beyond the window size. Furthermore, we introduce a simplified window self-attention (SiWSA) that operates within each window to capture intrinsic long-term dependencies among the shuffled features, regardless of the position information. Finally, after the SiWSA calculation, the features are shuffled back to their original positions to maintain data consistency. This overall NLSA mechanism enables the capture of non-local information without the need for excessively deep networks to enlarge the receptive field. Based on NLSA, we propose a non-local self-attention network (NLSAN) designed explicitly for the SR task. Through extensive experimental evaluations, we demonstrate the superior performance of NLSAN compared to several state-of-the-art SR methods in quantitative and qualitative assessments. The code of the proposed method is available at https://github.com/zengkun301/NLSAN.
computer science, artificial intelligence
What problem does this paper attempt to address?