EMA‐GAN: A Generative Adversarial Network for Infrared and Visible Image Fusion with Multiscale Attention Network and Expectation Maximization Algorithm

Xiuliang Xi,Xin Jin,Qian Jiang,Yu Lin,Wei Zhou,Lei Guo
DOI: https://doi.org/10.1002/aisy.202300310
IF: 7.298
2023-08-30
Advanced Intelligent Systems
Abstract:Expectation‐maximization algorithm generative adversarial network (EMA‐GAN) is proposed to fuse images from different modalities. This is an EM learning framework based on GAN that maximizes the likelihood of fused results and estimates potential variables. The axial‐corner attention and the multifrequency attention are used to highlight texture details in visible images and to extract the pixel information in infrared images. The purpose of the infrared and visible image fusion is to generate a fused image with rich information. Although most fusion methods can achieve good performance, there are still shortcomings in extracting feature information from source images, which make it difficult to balance the thermal radiation region information and texture detail information in the fused image. To address the above issues, an expectation maximization (EM) learning framework based on adversarial generative networks (GAN) for infrared and visible image fusion is proposed. The EM algorithm (EMA) can obtain maximum likelihood estimation for problems with potential variables, which is helpful in solving the problem of lack of labels in infrared and visible image fusion. The axial‐corner attention mechanism is designed to capture long‐range semantic information and texture information of the visible image. The multifrequency attention mechanism digs the relationships between features at different scales to highlight target information of infrared images in the fused result. Meanwhile, two discriminators are used to balance two different features, and a new loss function is designed to maximize the likelihood estimate of the data with soft class label assignments, which is obtained from the expectation network. Extensive experiments demonstrate the superiority of EMA‐GAN over the state‐of‐the‐art.
What problem does this paper attempt to address?