SketchScene: Scene Sketch to Image Generation with Diffusion Models.

Zhenbei Wu,Haoge Deng,Qiang Wang,Di Kong,Jie Yang,Yonggang Qi
DOI: https://doi.org/10.1109/icme55011.2023.00357
2023-01-01
Abstract:Sketch is an abstract visual representation that can be recovered as natural photographs in the human mind. Many researchers are drawn to work on translating abstract sketches to natural photographs. Since conventional sketch-to-image models are designed to generate images with a single object as the subject, generating scene image with multiple classes of objects is a tricky problem. To tackle this challenge, we propose the first scene sketch-to-image generation method based on diffusion models. Our model uses an encoder to summarize the contour and class features of the scene sketch into a latent variable, and a decoder to reconstruct scene images from it. In scene sketch-to-image generation tasks, our method outperforms the state-of-the-art methods. Experiments also show that our model beats other methods in zero-shot general sketch-to-image generation. It demonstrates our model's potential for full-domain image generation.
What problem does this paper attempt to address?