Abstract:Scene sketching is to convert a scene into a simplified, abstract representation that captures the essential elements and composition of the original scene. It requires semantic understanding of the scene and consideration of different regions within the scene. Since scenes often contain diverse visual information across various regions, such as foreground objects, background elements, and spatial divisions, dealing with these different regions poses unique difficulties. In this paper, we define a sketch as some sets of Bezier curves. We optimize the different regions of input scene in multiple rounds. In each round of optimization, strokes sampled from the next region can seamlessly be integrated into the sketch generated in the previous round of optimization. We propose additional stroke initialization method to ensure the integrity of the scene and the convergence of optimization. A novel CLIP-Based Semantic loss and a VGG-Based Feature loss are utilized to guide our multi-round optimization. Extensive experimental results on the quality and quantity of the generated sketches confirm the effectiveness of our method.

What problem does this paper attempt to address?

The main problems that this paper attempts to solve are two key challenges in scene sketch generation: 1. **Semantic understanding of complex scenes and processing of different regions**: Scene sketch generation requires semantic understanding of the scene and taking into account different regions within the scene. Since the scene contains diverse visual information (such as foreground objects, background elements, and spatial division), processing these different regions presents unique difficulties. Specifically, foreground objects or focal points may require more detailed depiction, while background elements can be represented more loosely. 2. **Balancing the visual effect and aesthetics of the sketch**: Existing methods usually rely on explicit sketch datasets for training, and the generated sketches are often simplified and abstract expressions with a fixed style or preset. This makes it difficult to balance the visual effect of the sketch while maintaining visual attractiveness and aesthetics. To solve these problems, the author proposes a region - based multi - round optimization method (MROSS: Multi - Round Region - Based Optimization for Scene Sketching). This method improves scene sketch generation in the following ways: - **Region - by - region optimization**: Optimize different regions of the input scene separately to ensure that the characteristics of each region can be accurately captured. - **Edge - guided stroke initialization**: Use the farthest point sampling (FPS) algorithm to sample stroke positions uniformly to emphasize content information. - **Novel loss function**: Introduce CLIP - based semantic loss and VGG - based feature loss to guide the multi - round optimization process, ensuring that the generated sketches have both semantic accuracy and retain geometric details. Through these improvements, this method can generate high - quality scene sketches at different levels of abstraction and can flexibly adjust the level of detail in different regions.

Multi-Round Region-Based Optimization for Scene Sketching

SceneSketcher: Fine-Grained Image Retrieval with Scene Sketches

SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs

Sketchformer++: A Hierarchical Transformer Architecture for Vector Sketch Representation

Region Assisted Sketch Colorization

Sketch2Scene: sketch-based co-retrieval and co-placement of 3D models

SketchScene: Scene Sketch to Image Generation with Diffusion Models.

Reconstructing 3D Shapes from Multiple Sketches using Direct Shape Optimization

A Global Energy Optimization Framework for 2.1D Sketch Extraction from Monocular Images.

Stroke-based semantic segmentation for scene-level free-hand sketches

CLIPascene: Scene Sketching with Different Types and Levels of Abstraction

Sketch-Guided Scene Image Generation

SketchDesc: Learning Local Sketch Descriptors for Multi-View Correspondence

CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing

Sketch-2-4d: Sketch Driven Dynamic 3d Scene Generation

Unsupervised Scene Sketch to Photo Synthesis

Sketch Simplification Guided by Complex Agglomeration.

Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches

SketchyScene: Richly-Annotated Scene Sketches

Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation

Sketch-pix2seq: a Model to Generate Sketches of Multiple Categories