Abstract:Infrared-visible image fusion is of great value in many applications due to their highly complementary information. However, it is hard to obtain high-quality fused image for current fusion algorithms. In this paper, we reveal an underlying deficiency in current fusion framework limiting the quality of fusion, i.e., the visual features used in the fusion can be easily affected by external physical conditions (e.g., the characteristics of different sensors and environmental illumination), indicating that those features from different sources have not been ensured to be fused on a consistent basis during the fusion. Inspired by biological vision, we derive a framework that transforms the image intensities into the visual response space of human visual system (HVS), within which all features are fused in the same perceptual state, eliminating the external physical factors that may influence the fusion process. The proposed framework incorporates some key characteristics of HVS that facilitate the simulation of human visual response in complex scenes, and is built on a new variant of multiscale decomposition, which can accurately localize image structures of different scales in visual-response simulation and feature fusion. A bidirectional saliency aggregation is proposed to fuse the perceived contrast features within the visual response space of HVS, along with an adaptive suppression of noise and intensity-saturation in this space prior to the fusion. The final fused image is obtained by transforming the fusion results in human visual response space back to the physical domain. Experiments demonstrate the significant improvement of fusion quality brought about by the proposed method.

Non-linear and selective fusion of cross-modal images

A Cross-Modal Image Fusion Method Guided by Human Visual Characteristics

Fusion Of Infrared And Visible Light Images Based On Nonsubsampled Shearlet Transform

Fusion of infrared and visual images through multiscale hybrid unidirectional total variation

Fusion of Visible and Infrared Images Using Saliency Analysis and Detail Preserving Based Image Decomposition

Multi-Frame Image Fusion Method Combining Spatial-Temporal Saliency Detection and Nsct

Cross-Modal Image Fusion Theory Guided by Subjective Visual Attention

CrossFuse: A Novel Cross Attention Mechanism based Infrared and Visible Image Fusion Approach

The Realistic Fusion of Multi-spectral Images

A Cross-scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals

A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

Multi-scale Convolutional Neural Networks and Saliency Weight Maps for Infrared and Visible Image Fusion

Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism

Fusion of Infrared and Visible Images based on Spatial-Channel Attentional Mechanism

A perceptual framework for infrared-visible image fusion based on multiscale structure decomposition and biological vision

Integrating Parallel Attention Mechanisms and Multi-Scale Features for Infrared and Visible Image Fusion

Infrared and Visible Image Fusion Method Based on Hierarchical Attention Mechanism

Infrared and Visible Image Fusion with Hierarchical Human Perception

Multi-modal Image Fusion with the Hybrid ℓ0ℓ1 Layer Decomposing and Multi-Directional Filter Banks