Abstract:Multimodal data fusion plays an increasingly important role in the field of artificial intelligence . The objective of Infrared and Visible Image Fusion (IVF) is to integrate information from different types of images to enhance the performance of target detection tasks. Meanwhile, object detection technology constitutes a crucial foundation in the field of autonomous driving. However, visible images captured under low illumination often lack important details, resulting in suboptimal fusion results,which in turn affects the accuracy of target detection tasks. We proposed an infrared and visible image fusion method based on adaptive visual enhancement and structural patch decomposition (ASFusion) to address the above issues. First, we design an efficient algorithm based on the camera response model to enhance different exposure matrices, allowing for adaptive enhancement of visible images. Second, we decompose the source infrared and the enhanced visible image into three components: mean intensity, signal structure, and signal intensity using structural patch decomposition (SPD), and then design a new degree of membership curve function to estimate the weight of the average intensity component accurately. The estimation process reduces artifacts and preserves the significance of infrared targets. Third, to achieve a higher contrast in the fusion result, we introduced sharpening operations to enhance the detail layer of both the infrared and visible images. Finally, the fused image is obtained by merging the base and detail layers. Through qualitative and quantitative experimental evaluations, the proposed method outperforms twelve state-of-the-art image fusion methods. Additionally, object detection experiments have demonstrated that our ASFusion exhibits tremendous potential in better serving advanced computer vision tasks. Our code is publicly available at https://github.com/ZhouVMC/ASFusion .

TeRF: Text-driven and Region-aware Flexible Visible and Infrared Image Fusion

DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer

CMRFusion: A cross-domain multi-resolution fusion method for infrared and visible image fusion

TCCFusion: An Infrared and Visible Image Fusion Method based on Transformer and Cross Correlation

Combining Regional Energy and Intuitionistic Fuzzy Sets for Infrared and Visible Image Fusion

Fusion of Low-Quality Visible and Infrared Images Based on Multi-Level Latent Low-Rank Representation Joint with Retinex Enhancement and Multi-Visual Weight Information

Feature dynamic alignment and refinement for infrared-visible image fusion: Translation robust fusion

FAFusion: Learning for Infrared and Visible Image Fusion via Frequency Awareness

A robust infrared and visible image fusion framework via multi-receptive-field attention and color visual perception

Infrared and visible image fusion based on domain transform filtering and sparse representation

SimpleFusion: A Simple Fusion Framework for Infrared and Visible Images

Infrared and Visible Image Fusion with Hybrid Image Filtering

Adaptive low light visual enhancement and high-significant target detection for infrared and visible image fusion

Infrared and Visual Image Fusion Based on a Local-Extrema-Driven Image Filter

SFCFusion: Spatial–Frequency Collaborative Infrared and Visible Image Fusion

ASFusion: Adaptive visual enhancement and structural patch decomposition for infrared and visible image fusion

Different Input Resolutions and Arbitrary Output Resolution: A Meta Learning-Based Deep Framework for Infrared and Visible Image Fusion

Event-based Visible and Infrared Fusion Via Multi-task Collaboration

SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion

Boosting Target-Level Infrared and Visible Image Fusion with Regional Information Coordination.

Visible and Infrared Image Fusion Based on Attention and Multiscale Residuals