Infrared-visible Image Fusion Based on Regional Attention Auto-Encoder

Peng Wang,Sheng Huang,Huimin Liu,Peng Tian
DOI: https://doi.org/10.1117/12.2690400
2023-01-01
Abstract:The images captured by a single sensor are often limited. How to use multi-sensor images has important value. For example, the imaging conditions of visible camera are relatively harsh, while the infrared camera can operate in all-day and all-weather and has longer visual distance. For better visual presentation and subsequent perception tasks, we focused on the infrared and visible image fusion based on auto-encoder. Specifically, we proposed a fusion strategy based on regional attention and a multi-scale convolution layer. The fusion strategy based on regional attention divides a image into several regions and adopts different fusion strategy for different regions. Multi-scale convolution layer is to capture the features of different receptive fields and improve the semantic representation ability of the encoder. From detailed experimental results, we can see that the optimized fusion algorithm is more robust, reduces the sensitivity to the classifier, and keeps more textures of background.
What problem does this paper attempt to address?