An Unsupervised Method for Chinese Speech Text Localization in Comic Images

Dong LIU,Luyuan LI,Yongtao WANG,Zhi TANG
DOI: https://doi.org/10.13209/j.0479-8023.2014.008
2014-01-01
Abstract:For satisfying the growing needs of reading Chinese comic images on mobile devices, the authors propose an unsupervised Chinese speech text localization method which is different from the existing learning-based methods. The method consists of three major stages: 1) the first stage is to detect the white region that surrounds the text characters (speech balloons, similarly hereinafter) using the connectivity of white region within the balloons and localize the characters within the speech balloon; 2) the detected characters are clustered into character strings (a row or column of characters aligning horizontally or vertically) based on the character shape and the consistency of typesetting, and their font features are extracted; 3) based on the features of the extracted fonts, the third stage is to detect rest of the character strings via Bayesian classifier. The proposed method is tested on a dataset consists of 900 comic images and reaches satisfactory results. ? 2014 Peking University.
What problem does this paper attempt to address?