CAFNET: Cross-Attention Fusion Network for Infrared and Low Illumination Visible-Light Image

Zhou, Xiaoling
DOI: https://doi.org/10.1007/s11063-022-11125-9
IF: 2.565
2022-12-31
Neural Processing Letters
Abstract:Due to the sampling and pooling operations, deep learning-based infrared and visible-light image fusion methods often result into detail loss problem, especially under low illumination. Therefore, we propose a novel cross-attention fusion network (CAFNET) to fuse infrared and low illumination visible-light image merely based on the first layer of the pre-trained VGG16 network. Firstly, features with the same size of source images are extracted by the first layer of pre-trained VGG16 respectively. Then, based on the extracted features, cross attention is calculated to distinguish the differences between infrared and visible-light image, and the spatial attention is computed to reflect the characteristics of infrared and visible-light image. After that, weight maps are gained through modulating the cross attention and spatial attention, based on which the source images are pre-fused. Finally, Gaussian blur-based details injection is performed to further enhance the details of the pre-fused image. Experiment results show that, compared with the traditional multi-scale and state-of-the-art deep learning-based fusion methods, our approach can achieve better performance both in subjective and objective evaluations.
computer science, artificial intelligence
What problem does this paper attempt to address?