Abstract:Sketch recognition remains a significant challenge due to the limited training data and the substantial intra-class variance of freehand sketches for the same object. Conventional methods for this task often rely on the availability of the temporal order of sketch strokes, additional cues acquired from different modalities and supervised augmentation of sketch datasets with real images, which also limit the applicability and feasibility of these methods in real scenarios. In this paper, we propose a novel sketch-specific data augmentation (SSDA) method that leverages the quantity and quality of the sketches automatically. From the aspect of quantity, we introduce a Bezier pivot based deformation (BPD) strategy to enrich the training data. Towards quality improvement, we present a mean stroke reconstruction (MSR) approach to generate a set of novel types of sketches with smaller intra-class variances. Both of these solutions are unrestricted from any multi-source data and temporal cues of sketches. Furthermore, we show that some recent deep convolutional neural network models that are trained on generic classes of real images can be better choices than most of the elaborate architectures that are designed explicitly for sketch recognition. As SSDA can be integrated with any convolutional neural networks, it has a distinct advantage over the existing methods. Our extensive experimental evaluations demonstrate that the proposed method achieves the state-of-the-art results (84.27%) on the TU-Berlin dataset, outperforming the human performance by a remarkable 11.17% increase. Finally, more experiments show the practical value of our approach for the task of sketch-based image retrieval.

Sketch Recognition with Deep Visual-Sequential Fusion Model

Sketchformer++: A Hierarchical Transformer Architecture for Vector Sketch Representation

<i>Sketch-R2CNN</i>: An RNN-Rasterization-CNN Architecture for Vector Sketch Recognition

Sketch-R2CNN: an Attentive Network for Vector Sketch Recognition

Sequential Dual Deep Learning with Shape and Texture Features for Sketch Recognition

Sketch-Based 3D Model Retrieval via Multi-feature Fusion

SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs

End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning

Sketch-Specific Data Augmentation for Freehand Sketch Recognition

SFSegNet: Parse Freehand Sketches using Deep Fully Convolutional Networks

Entropy Information‐based Heterogeneous Deep Selective Fused Features Using Deep Convolutional Neural Network for Sketch Recognition

On Learning Semantic Representations for Large-Scale Abstract Sketches

Deep Stroke-Based Sketched Symbol Reconstruction and Segmentation

Enhancing Sketch-Based Image Retrieval Via Deep Discriminative Representation.

Sketch-R2CNN : An RNN-Rasterization-CNN Architecture for Vector Sketch Recognition

Domain Alignment Embedding Network for Sketch Face Recognition

A hierarchical residual network with compact triplet-center loss for sketch recognition

Sketch-pix2seq: a Model to Generate Sketches of Multiple Categories

Deep CNN-based Features for Hand-Drawn Sketch Recognition Via Transfer Learning Approach

Stroke-based semantic segmentation for scene-level free-hand sketches

Enhance Sketch Recognition's Explainability via Semantic Component-Level Parsing