An Improved Word Spotting Method for Printed Uyghur Document Image Retrieval

Eksan Firkat,Askar Hamdulla,Palidan Tuerxun,Abdusalam Dawut
DOI: https://doi.org/10.1007/978-981-32-9298-7_9
2019-01-01
Abstract:Key word spotting plays an important role in the field of Uyghur printed document retrieval. However, the cursive nature of Uyghur script causes some drawbacks for retrieving a word correctly with word spotting approach based on SIFT (Scale Invariant Feature Transform) feature. To overcome this limitation, this paper proposes a new approach by introducing the concept of considering the retrieval result of EDM (Euclidean distance mapping) matching algorithm as a geometry information by homograph matrix and perspective transformation to further improve the accuracy of word spotting. The comparative experiment of the proposed method is evaluated on a dataset of 190 Uyghur printed document images which contain about 17648 words. Experimental result demonstrates that the proposed approach is an effective method of retrieving the word comparing with the previous word spotting method used in Uyghur printed document image.
What problem does this paper attempt to address?