SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

Yuhang Song,Chao Yang,Yeji Shen,Peng Wang,Qin Huang,C.-C. Jay Kuo

DOI: https://doi.org/10.48550/arXiv.1805.03356

2018-08-07

Abstract:In this paper, we focus on image inpainting task, aiming at recovering the missing area of an incomplete image given the context information. Recent development in deep generative models enables an efficient end-to-end framework for image synthesis and inpainting tasks, but existing methods based on generative models don't exploit the segmentation information to constrain the object shapes, which usually lead to blurry results on the boundary. To tackle this problem, we propose to introduce the semantic segmentation information, which disentangles the inter-class difference and intra-class variation for image inpainting. This leads to much clearer recovered boundary between semantically different regions and better texture within semantically consistent segments. Our model factorizes the image inpainting process into segmentation prediction (SP-Net) and segmentation guidance (SG-Net) as two steps, which predict the segmentation labels in the missing area first, and then generate segmentation guided inpainting results. Experiments on multiple public datasets show that our approach outperforms existing methods in optimizing the image inpainting quality, and the interactive segmentation guidance provides possibilities for multi-modal predictions of image inpainting.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to use semantic segmentation information to guide the restoration of missing regions in the image inpainting task, so as to generate clearer and more realistic boundaries and textures. Specifically, the existing methods based on generative models often lead to blurry results when dealing with object boundaries, because these methods do not fully utilize the segmentation information to constrain the object shapes. To solve this problem, the paper proposes a new framework, that is, by introducing semantic segmentation information, decoupling the differences between different categories and the changes within the same category, thereby achieving clearer restored boundaries and better texture consistency. The paper proposes a two - step model, called SPG - Net (Segmentation Prediction and Guidance Network), for image inpainting: 1. **Segmentation Prediction Network (SP - Net)**: First, predict the segmentation labels of the missing regions, providing prior knowledge of the object positions and shapes. 2. **Segmentation Guidance Network (SG - Net)**: Combine the complete segmentation mask and the input image to generate the final inpainting result. Through this method, the paper aims to improve the quality of image inpainting, especially at the boundaries between different objects, and at the same time provides the possibility of multi - modal prediction, allowing users to generate different inpainting results by editing the segmentation mask.

SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

Image Inpainting Based on Interactive Separation Network and Progressive Reconstruction Algorithm

Interactive Separation Network for Image Inpainting

Semantic Residual Pyramid Network for Image Inpainting

MMGInpainting: Multi-Modality Guided Image Inpainting Based On Diffusion Models

Boosted GAN with Semantically Interpretable Information for Image Inpainting

Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid

SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Face Image Inpainting Based on Generative Adversarial Network

An Adaptive Post-Processing Network with the Global-Local Aggregation for Semantic Segmentation

PSSD-Transformer: Powerful Sparse Spike-Driven Transformer for Image Semantic Segmentation

Unbiased Multi-Modality Guidance for Image Inpainting

Semantic Image Inpainting with Multi-Stage Feature Reasoning Generative Adversarial Network

ESGAN: Edge Loss and Spatial Convolution Generative Adversarial Network for Image Inpainting

JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting

A Progressive and Multi-Prior-Guided Network for Image Inpainting

SemID: Blind Image Inpainting with Semantic Inconsistency Detection

Inpainting with Sketch Reconstruction and Comprehensive Feature Selection

DeepGIN: Deep Generative Inpainting Network for Extreme Image Inpainting

Generative Image Inpainting with Residual Attention Learning

An Adaptive Iterative Inpainting Method with More Information Exploration