GAN-GA: infrared and visible image fusion generative adversarial network based on global awareness

DOI: https://doi.org/10.1007/s10489-024-05561-4
IF: 5.3
2024-06-03
Applied Intelligence
Abstract:The current generative adversarial network (GAN)-based methods for infrared and visible image fusion often overlook global information, leading to inappropriate distribution of fused images in terms of infrared intensity and an overall incongruous presentation. In order to tackle this issue, in this paper, a new global awareness fusion network based on GAN is proposed, termed as GAN-GA. The proposed method comprises a generator and two discriminators. The generator consists of three series-connected global awareness blocks (GABlock), namely detail branch, content branch and feature aggregation block. The detail branch employs a convolutional network and max pooling to extract local detailed information, while the content branch utilizes Transformer to obtain global information. The local details and global information are then passed through the feature aggregation block to focus attention on distinct spatial locations and assign weights based on information importance, resulting in the final fused image. Moreover, the primary and auxiliary concepts are introduced in content loss, incorporating the difference images of infrared and visible as supplementary information into the loss function to fully leverage the information in the source images. For the discriminator, the WGAN-LP loss is employed to constrain its training, which introduces a new gradient penalty based on WGAN-GP to enhance the capability of discriminator. With these enhancements, GAN-GA effectively captures the global features of the source image while preserving the local texture and infrared intensity, which obtains an overall more coordinated fused image.
computer science, artificial intelligence
What problem does this paper attempt to address?