ResCCFusion: Infrared and Visible Image Fusion Network Based on ResCC Module and Spatial Criss-Cross Attention Models

Zhang Xiong,Xiaohui Zhang,Hongwei Han,Qingping Hu
DOI: https://doi.org/10.1016/j.infrared.2023.104962
IF: 2.997
2023-01-01
Infrared Physics & Technology
Abstract:We proposed an infrared and visible image fusion method based on the ResCC module and spatial criss-cross attention models. The proposed method adopts an auto-encoder structure consisting of an encoder network, fusion layers, and a decoder network. The encoder network has a convolution layer and three ResCC blocks with dense connections. Each ResCC block can extract multi-scale features from source images without downsampling operations and retain as many feature details as possible for image fusion. The fusion layer adopts spatial criss-cross attention models, which can capture contextual information in both horizontal and vertical directions. Attention in these two directions can also reduce the calculation of the attention maps. The decoder network consists of four convolution layers designed to reconstruct images from the feature map. Experiments performed on the public datasets demonstrate that the proposed method obtains better fusion performance on objective and subjective evaluations compared to other advanced fusion methods. The code is available at https ://github.com/xiongzhangzzz/ResCCFusion.
What problem does this paper attempt to address?