Comic Text Detection and Recognition Based on Deep Learning

Hong Xin,Chi Ma,De Li
DOI: https://doi.org/10.1109/icaml54311.2021.00012
2021-07-01
Abstract:In recent years, there are a variety of methods for detecting comic characters or avatars for research in the field of comic digitization, leading to efficient retrieval and analysis of comics. However, relatively little research has been conducted on the experiments and applications of comic text detection. Therefore, this paper adopts a deep learning-based target detection method for detecting text in comic-YOLOv3. The comic dataset chosen for this paper is the Japanese manga l09 dataset, and a new comic dataset-MH60 is proposed, in which all the comic are Chinese comics. The experimental results show that the AP of YOLOv3 model is about 0.89 on the manga l09 dataset and about 0.87 on the MH60 dataset. for the detected text boxes, this paper recognizes the comic text by two text recognition tools and performs a simple analysis, the results show that the text recognition accuracy of Baidu API is high and can be used for comic text recognition.
What problem does this paper attempt to address?