Abstract:When infrared targets are located at the edge of an image or when the targets are relatively small, the standard infrared and visible image fusion algorithm becomes a major problem because it relies on manually designed strategies and low-level image statistics for saliency detection. To address this issue, SeGFuison is proposed. It is a semantic saliency guided infrared and visible image fusion method composed of an autoencoder, a fusion layer, and a Semantic Segmentation-based Deep Saliency model (SSDS). It focuses on the structural information of images and generates saliency maps at the feature level, so that infrared targets can be extracted more accurately, thereby avoiding the introduction of artifacts and noise in fusion images. Incorporating saliency maps dynamically generated by SSDS, our approach effectively guides the training process of the fusion model. This strategic utilization guarantees that the resulting fused image maintains a saliency map that closely resembles that of the original infrared image. Furthermore, saliency maps are employed to partition images into distinct regions, namely target areas and background areas. This segmentation enables the design of distinct loss functions tailored to the unique characteristics of each area. As a result, our approach ensures the fusion of images preserves both salient targets and intricate background details, thus upholding a comprehensive depiction of fusion information. Through rigorous experimentation conducted on widely recognized public datasets including TNO, RoadScene, and MSRS, our algorithm has exhibited distinct advantages over contemporary state-of-the-art algorithms, both in terms of objective metrics and subjective evaluations. Notably, SeGFusion attains remarkable scores on key indicators such as FMI, VIF, and SD, affirming its superiority. Furthermore, it excels in subjective assessments, producing fused images of unparalleled clarity. The obtained experimental results compellingly showcase the inherent potential of our proposed algorithm, thereby substantiating its viability for diverse applications within fields such as infrared instruments and equipment.

SPFusion: A multi-task semantic perception infrared and visible light fusion method with quality assessment

Fusion of Infrared and Visible Images Via Multi-Layer Convolutional Sparse Representation

PIAFusion: A progressive infrared and visible image fusion network based on illumination aware

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion

Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic injection and scene fidelity

Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images

SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion

ASFusion: Adaptive visual enhancement and structural patch decomposition for infrared and visible image fusion

Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network

A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

SFCFusion: Spatial–Frequency Collaborative Infrared and Visible Image Fusion

SeGFusion: A semantic saliency guided infrared and visible image fusion method

MVSFusion: infrared and visible image fusion method for multiple visual scenarios

Adaptive low light visual enhancement and high-significant target detection for infrared and visible image fusion

FAFusion: Learning for Infrared and Visible Image Fusion via Frequency Awareness

DCFusion: A Dual-Frequency Cross-Enhanced Fusion Network for Infrared and Visible Image Fusion.

SCFusion: Infrared and Visible Fusion Based on Salient Compensation

HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation

A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion

SIGFusion: Semantic Information-Guided Infrared and Visible Image Fusion

An efficient frequency domain fusion network of infrared and visible images