TCGAN: Three-Channel Generate Adversarial Network

Wanliang Wang,Hangyao Tu,Jiacheng Chen,Fei Wu
DOI: https://doi.org/10.1007/s11042-023-15672-8
IF: 2.577
2024-01-01
Multimedia Tools and Applications
Abstract:Recently Image-to-image translation has achieve much progress in the literature. However, in exist method, border distortion, color distortion and others are the serious issues which continue to be resolved. Existing methods do not produce satisfactory results because the most exist methods are mainly used multi-channels which increase the difficulty of finding the gradient in optimizer. To address this problem, we proposed the Three-Channel Generative Adversarial Network. The algorithm decomposed color image into three color channels of RGB and utilized the single channel generators and dual discriminators in each color channel for adversarial training. The detailed discriminator adopted reversed PatchGAN which we proposed to be responsible for the image texture discrimination, and the structure discriminator adopted seven-layer convolutional structure to be responsible for the image structure discrimination. Then to improve the accuracy of translation, the loss function that associated to the network model has also been revised. In experiments, the ablation study were provided to prove the effectiveness of the algorithm on Cityscapes and Facades datasets. Our extensive experiments on a variety of datasets, including Style transfer, Image labeling transfer and End-to-end image dehazing, which consistently demonstrate clear improvement over the pix2pix method both qualitatively and quantitatively.
What problem does this paper attempt to address?