Unconstrained Text Detection in Manga: a New Dataset and Baseline

Julián Del Gobbo,Rosana Matuk Herrera
DOI: https://doi.org/10.48550/arXiv.2009.04042
2020-09-09
Computer Vision and Pattern Recognition
Abstract:The detection and recognition of unconstrained text is an open problem in research. Text in comic books has unusual styles that raise many challenges for text detection. This work aims to binarize text in a comic genre with highly sophisticated text styles: Japanese manga. To overcome the lack of a manga dataset with text annotations at a pixel level, we create our own. To improve the evaluation and search of an optimal model, in addition to standard metrics in binarization, we implement other special metrics. Using these resources, we designed and evaluated a deep network model, outperforming current methods for text binarization in manga in most metrics.
What problem does this paper attempt to address?