FAColorGAN: a Dual-Branch Generative Adversarial Network for Near-Infrared Image Colorization

Jin Duan,Meiling Gao,Guangyu Zhao,Weiqiang Zhao,Suxin Mo,Wenxue Zhang
DOI: https://doi.org/10.1007/s11760-024-03266-2
2024-01-01
Abstract:In addressing the issues of detail loss and poor robustness in near-infrared (NIR) image colorization tasks, this paper introduces a dual-branch approach for NIR image colorization. The proposed method is based on Generative Adversarial Network, comprising a spatial-frequency skip (FS) branch and an attention prior (AP) branch. Initially, the two-dimensional discrete wavelet transform is introduced in the FS branch to ensure the preservation of higher-frequency information in NIR images. Subsequently, to prevent overfitting of the model, a pre-trained VGG19 model is introduced in the AP branch along with contrast and noise attention modules. This enhances image contrast while reducing noise information, thereby improving the generalization capability of the network model. Finally, the dual-branch structure is employed to perform colorization and detail reconstruction on the images, resulting in a superior coloring effect for NIR images. Additionally, a novel joint loss function is introduced to guide the network training from multiple perspectives, enhancing the overall performance of the network. Comparative experiments with state-of-the-art methods demonstrate that the proposed approach effectively overcomes the issues of detail loss and poor robustness in NIR images. The results exhibit enhanced clarity and naturalness, aligning well with human visual perception. The proposed method achieves an improvement of more than 1.0868 dB in peak signal-to-noise ratio and 0.0177 in structure similarity over the best models available in the literature.
What problem does this paper attempt to address?