SimpliFusion: a simplified infrared and visible image fusion network

Yong Liu,Xingyuan Li,Yong Liu,Wei Zhong
DOI: https://doi.org/10.1007/s00371-024-03423-1
IF: 2.835
2024-05-30
The Visual Computer
Abstract:This paper introduces SimpliFusion, a network designed for the fusion of infrared and visible images, leveraging a simplified transformer architecture. SimpliFusion is engineered to adeptly handle both long-range and short-range information, facilitating a more effective integration of infrared and visible data. The core of SimpliFusion lies in its innovative use of a streamlined transformer model, which simplifies the traditional complexities associated with transformer networks while maintaining high efficiency and accuracy in image fusion tasks. The network architecture of SimpliFusion incorporates specialized attention mechanisms that are adept at capturing and integrating diverse spatial and temporal features from both infrared and visible spectra. This includes an intra-domain fusion unit based on self-attention for processing within each spectral domain, and an inter-domain fusion unit based on cross-attention for bridging and integrating information across the infrared and visible domains. These units are specifically designed to exploit the long-range dependencies characteristic of infrared data and the detailed textural information prevalent in visible images. Extensive experiments conducted on a range of multi-modal image fusion scenarios, including both multi-modal image fusion and object detection, demonstrate the superiority of SimpliFusion.
computer science, software engineering
What problem does this paper attempt to address?