SketchDeco: Decorating B&W Sketches with Colour

Chaitat Utintu,Pinaki Nath Chowdhury,Aneeshan Sain,Subhadeep Koley,Ayan Kumar Bhunia,Yi-Zhe Song
2024-05-29
Abstract:This paper introduces a novel approach to sketch colourisation, inspired by the universal childhood activity of colouring and its professional applications in design and story-boarding. Striking a balance between precision and convenience, our method utilises region masks and colour palettes to allow intuitive user control, steering clear of the meticulousness of manual colour assignments or the limitations of textual prompts. By strategically combining ControlNet and staged generation, incorporating Stable Diffusion v1.5, and leveraging BLIP-2 text prompts, our methodology facilitates faithful image generation and user-directed colourisation. Addressing challenges of local and global consistency, we employ inventive solutions such as an inversion scheme, guided sampling, and a self-attention mechanism with a scaling factor. The resulting tool is not only fast and training-free but also compatible with consumer-grade Nvidia RTX 4090 Super GPUs, making it a valuable asset for both creative professionals and enthusiasts in various fields. Project Page: \url{<a class="link-external link-https" href="https://chaitron.github.io/SketchDeco/" rel="external noopener nofollow">this https URL</a>}
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper introduces a new sketch coloring method called SketchDeco, inspired by children's coloring activities. The method aims to balance accuracy and convenience by utilizing region masks and color palettes to allow users to intuitively control the coloring process, avoiding the tediousness of manually coloring each region or the limitations of relying on textual prompts. By combining ControlNet and a staged generation approach, along with StableDiffusion v1.5 and BLIP-2 textual prompts, SketchDeco enables image generation and user-guided coloring, ensuring consistency in local and global colors. The main challenge mentioned in the paper is how to determine the applied colors and their positions. Early methods involved assigning colors to individual regions, while more recent approaches use textual descriptions, but this may result in insufficient accuracy. SketchDeco specifies "where" to color through region masks and represents the desired color scheme with a color palette. The task of the AI is to find the best way to color within the designated regions using the selected color palette, while maintaining overall harmony. To achieve this goal, the paper proposes several innovative solutions, including reverse sampling strategies, guided sampling, and self-attention mechanisms with scaling factors. This approach does not require training and is applicable to consumer-grade Nvidia RTX 4090 Super GPUs, making it a valuable tool for creative professionals and enthusiasts.