Image inpainting method based on AU-GAN

Chuangchuang Dong,Huaming Liu,Xiuyou Wang,Xuehui Bi
DOI: https://doi.org/10.1007/s00530-024-01290-3
IF: 3.9
2024-03-31
Multimedia Systems
Abstract:Image inpainting refers to the process of filling in missing regions or removing objects, and has broad application prospects. The rapid development of deep learning has led to new technological breakthroughs in image repair technology, continuously improving the quality of image inpainting. However, when we inpaint large missing regions, the texture and structural features of the image cannot be comprehensively utilized. This leads to blurry images. To solve this problem, we propose an improved dual-stream U-Net algorithm that adds an attention mechanism to the two U-Net networks known as a dual AU-Net network to improve the texture details of the image. In addition, the location code (LC) of damaged regions is added to the network to guide network repair and accelerate the network convergence speed. Least squares GAN (LSGAN) loss is added to the generator's adversarial network to capture more content details and enhance training stability. The PSNR is 33.93 and the SSIM is 0.931 in the CelebA and Paris datasets. This method has been proven effective when compared to other methods.
computer science, information systems, theory & methods
What problem does this paper attempt to address?
The paper attempts to address the problem in image inpainting where existing methods fail to fully utilize the texture and structural features of the image when repairing large missing areas, resulting in blurry repaired images. Specifically, existing image inpainting methods often encounter the following issues when dealing with large missing areas: 1. **Incomplete texture and structural features**: Existing repair methods cannot fully extract and utilize the texture and structural information of the image, leading to poor quality of the repaired image. 2. **Low training efficiency**: Some methods have a slow convergence speed during training, requiring a long time to achieve good repair results. 3. **Poor visual consistency**: The repaired image may lack coherence and consistency visually, especially when dealing with complex scenes. To address these issues, the paper proposes an improved dual-stream U-Net algorithm and incorporates attention mechanism, location code, and Least Squares GAN (LSGAN) loss function in the generator. These improvements aim to enhance the texture details and overall quality of the repaired image, accelerate the convergence speed of the network, and ensure visual consistency and coherence in the repair results.