Infrared and visible image fusion based on double fluid pyramids and multi-scale gradient residual block

Shan Pang,Hongtao Huo,Xin Yang,Jing Li,Xiaowen Liu
DOI: https://doi.org/10.1016/j.infrared.2023.104702
2023-05-04
Abstract:Infrared and visible image fusion can capture multi-modal features and generate an informative fused image. However, most existing encoder and decoder-based fusion methods cannot fully utilize the output features of each layer in the deep learning network. In most cases, there are no elaborate modules to fully extract the detail information of multi-scale feature maps. To this end, this work proposes a novel encoder and decoder-based infrared and visible image fusion network named PG-Fusion. In feature extraction part, the double fluid pyramids are designed to integrate and preserve features of each layer of the encoder in two independent data flows. Multi-layer structure and injected attention mechanisms bridge the gaps between low-level features and high-level features and promote the interaction of multi-grained features. Specifically, a multi-scale gradient residual block is devised to preserve enough fine-grained textures and details from source images, which can facilitate the extraction of discriminative information and boost the description ability of the network. In addition, sliding window comparison strategy based intensity loss function and gradient loss function are utilized to impel our network towards a final optimal solution. Extensive qualitative and quantitative experiment results illustrate that PG-Fusion achieves better fusion performance compared with eleven typical algorithms.
optics,physics, applied,instruments & instrumentation
What problem does this paper attempt to address?