Synthetic images generation for text detection and recognition in the wild

Natalia Khanzhina,Natalia Slepkova,Andrey Filchenkov
DOI: https://doi.org/10.1117/12.2557064
2020-01-31
Abstract:Deep neural networks help solving different images related tasks very efficiently, though their cost is high whereas a lot of data are required for training. While there is a great demand to build neural network models for optical character detection and recognition for different languages, such as, for mobile real-time applications, datasets collecting and labeling are quite expensive. In this paper, we propose the fully automated approach for synthetic images with text generation based on deep learning and projective geometry methods. For evaluation, we trained two neural networks on the dataset generated by our algorithm. Our approach enables to decrease the false negative rate on real images from SVT and SVT-50 datasets in comparison with training on SynthText dataset, giving ~1% of F1-measure increasing.
What problem does this paper attempt to address?