SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

Yuhang Song,Chao Yang,Yeji Shen,Peng Wang,Qin Huang,C.-C. Jay Kuo
DOI: https://doi.org/10.48550/arXiv.1805.03356
2018-08-07
Abstract:In this paper, we focus on image inpainting task, aiming at recovering the missing area of an incomplete image given the context information. Recent development in deep generative models enables an efficient end-to-end framework for image synthesis and inpainting tasks, but existing methods based on generative models don't exploit the segmentation information to constrain the object shapes, which usually lead to blurry results on the boundary. To tackle this problem, we propose to introduce the semantic segmentation information, which disentangles the inter-class difference and intra-class variation for image inpainting. This leads to much clearer recovered boundary between semantically different regions and better texture within semantically consistent segments. Our model factorizes the image inpainting process into segmentation prediction (SP-Net) and segmentation guidance (SG-Net) as two steps, which predict the segmentation labels in the missing area first, and then generate segmentation guided inpainting results. Experiments on multiple public datasets show that our approach outperforms existing methods in optimizing the image inpainting quality, and the interactive segmentation guidance provides possibilities for multi-modal predictions of image inpainting.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use semantic segmentation information to guide the restoration of missing regions in the image inpainting task, so as to generate clearer and more realistic boundaries and textures. Specifically, the existing methods based on generative models often lead to blurry results when dealing with object boundaries, because these methods do not fully utilize the segmentation information to constrain the object shapes. To solve this problem, the paper proposes a new framework, that is, by introducing semantic segmentation information, decoupling the differences between different categories and the changes within the same category, thereby achieving clearer restored boundaries and better texture consistency. The paper proposes a two - step model, called SPG - Net (Segmentation Prediction and Guidance Network), for image inpainting: 1. **Segmentation Prediction Network (SP - Net)**: First, predict the segmentation labels of the missing regions, providing prior knowledge of the object positions and shapes. 2. **Segmentation Guidance Network (SG - Net)**: Combine the complete segmentation mask and the input image to generate the final inpainting result. Through this method, the paper aims to improve the quality of image inpainting, especially at the boundaries between different objects, and at the same time provides the possibility of multi - modal prediction, allowing users to generate different inpainting results by editing the segmentation mask.