Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation

Fei Gao,Yifan Zhu,Chang Jiang,Nannan Wang
2023-09-01
Abstract:Facial sketch synthesis (FSS) aims to generate a vivid sketch portrait from a given facial photo. Existing FSS methods merely rely on 2D representations of facial semantic or appearance. However, professional human artists usually use outlines or shadings to covey 3D geometry. Thus facial 3D geometry (e.g. depth map) is extremely important for FSS. Besides, different artists may use diverse drawing techniques and create multiple styles of sketches; but the style is globally consistent in a sketch. Inspired by such observations, in this paper, we propose a novel Human-Inspired Dynamic Adaptation (HIDA) method. Specially, we propose to dynamically modulate neuron activations based on a joint consideration of both facial 3D geometry and 2D appearance, as well as globally consistent style control. Besides, we use deformable convolutions at coarse-scales to align deep features, for generating abstract and distinct outlines. Experiments show that HIDA can generate high-quality sketches in multiple styles, and significantly outperforms previous methods, over a large range of challenging faces. Besides, HIDA allows precise style control of the synthesized sketch, and generalizes well to natural scenes and other artistic styles. Our code and results have been released online at: <a class="link-external link-https" href="https://github.com/AiArt-HDU/HIDA" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in facial sketch synthesis (FSS), existing methods only rely on 2D representations of facial photos, such as semantic or appearance features, while ignoring 3D geometric information which is crucial for generating high - quality sketches. In addition, different artists may use different painting techniques to create diverse sketch styles, but these styles are consistent within a single sketch. However, existing FSS methods have difficulty dealing with significant changes in appearance, such as changes in pose, lighting, expression, and skin color, and lack global consistency control of style. To meet this challenge, the paper proposes a new method - the human - inspired dynamic adaptation (HIDA) method. This method dynamically adjusts neuron activation by combining facial 3D geometry (such as depth maps), 2D appearance, and globally consistent style control, thereby generating high - quality multi - style sketches. Specifically, the HIDA method uses deformable convolution to align deep features at a coarse scale to generate abstract and unique contour lines. Experiments show that HIDA can generate high - quality sketches in multiple styles and significantly outperforms existing methods, especially when dealing with challenging faces. Moreover, HIDA allows for precise style control and can generalize well to natural scenes and other artistic styles.