FusionDiff: A unified image fusion network based on diffusion probabilistic models

Zefeng Huang,Shen Yang,Jin Wu,Lei Zhu,Jin Liu
DOI: https://doi.org/10.1016/j.cviu.2024.104011
IF: 4.886
2024-04-13
Computer Vision and Image Understanding
Abstract:This paper introduces FusionDiff, a novel unified end-to-end image fusion network based on diffusion probabilistic models. The proposed method addresses the data dependency issue present in current unified image fusion approaches. Initially, the method pre-fuses the source images after diffusion. Subsequently, a noise prediction network forecasts the noise applied to the pre-fused image. During this stage, the Spatially-Adaptive Constraint layer is employed to restrict the fusion image, thereby maximizing the preservation of respective source image features. Finally, in the reverse process, skip-sampling is employed to effectively overcome the computational time drawback of traditional diffusion probabilistic models while maintaining high-quality image generation. Compared to other diffusion model-based image fusion methods, our approach stands out for its structural simplicity and ease of training, serving as a unified image fusion method. Furthermore, compared to other unified image fusion methods, our proposed network fully leverages the generalization of the diffusion probabilistic model, achieving adaptive feature extraction by constraining the inverse process of source images. Qualitative and quantitative experimental results across four classic image fusion tasks demonstrate the superiority of our method over state-of-the-art unified image fusion methods in recent years.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?