The Hybrid Attention Single Image Super Resolution Network with Conditional Random Field

Jiayu Wei,Yanfeng Li,Houjin Chen,Jia Sun,Xu Yao
DOI: https://doi.org/10.1117/12.3021083
2024-01-01
Abstract:Recently, Transformer-based methods have achieved excellent results in various computer vision tasks, including Single Image Super-Resolution (SISR). In SwinIR, the mechanism of cross-window connection and local self-attention of Swin Transformer are introduced into the SISR task, achieving breakthrough improvements. However, the local self-attention mechanism of Swin Transformer has a limited spatial range of input pixels, which limits the ability of the super-resolution network to extract features in a wide range. Aiming at this problem, an enhanced CNN and Transformer hybrid module is designed for feature extraction by combining self-attention, spatial attention and channel attention. Taking advantage of their complementary strengths, the range of activated pixels is expanded while still maintaining a strong capability for local feature characterization. In addition, simply extending the activation pixel range without constraints is not conducive to reconstruction. Aiming at this problem, the Neural Window Fully-connected Conditional Random Fields (NeW FC-CRFs) are integrated for feature fusion. The shallow features are inputted into NeW FC-CRFs along with deep features, allowing for the utilization of multi-level information during the fusion process. In summary, we propose the Hybrid Attention Super Resolution Network with Conditional Random Field (HANCRF). Extensive experiments show that HANCRF achieves competitive results with a small number of parameters.
What problem does this paper attempt to address?