Construction of Uyghur Scene Text Image Database
Yilihamu Aili,Yiwen Wang,Pei Liu,Alimujiang Abudiriyimu,Kurban Ubul
DOI: https://doi.org/10.1145/3497623.3497660
2021-10-15
Abstract:In recent years, the International Conference on Document Analysis and Recognition (ICDAR) has released a data set of multilingual text in natural scene images. The data set contains natural scene images in nine different languages, including Chinese, Arabic, Korean, etc. At present, it is rare to see a comprehensive data set of Uyghur scene text images in natural scene images. Therefore, the main purpose of this article is to create a new Uyghur text data set in natural scene images and provide it to the research community to develop and evaluate the latest Uyghur text recognition and detection algorithms for text retrieval and recognition. Images of natural scenes, segmented characters and composite images of scenes, and Uyghur words in the images are manually marked at the word level from the captured images. The data set includes 2381 Uyghur advertisement pictures, 17,638 video interceptions, 394 complex natural scenes that can be used for Uyghur text detection in complex scenes, and 200067 Uyghur text pictures that can be used for Uyghur text recognition. These pictures come from web crawlers, Uighur animated films and natural street scenes in Xinjiang, China. They include flat text, multi-angle text, raised 3D text, artistic text, distant text, low-light text, partially occluded text, etc. Due to the diversity of the collection environment and the complexity of the image background, the database can be used as a benchmark for Uyghur text detection and recognition in natural scene images, which poses great challenges.