Approach to matching partial word image and its application to document image retrieval

yue lu,chew lim tan,lin lin
DOI: https://doi.org/10.1117/12.483240
2002-01-01
Abstract:An approach with the capability of matching partial word image is proposed in this paper, to facilitate the issues of document image retrieval, such as detection of user-specified query words, and similarity measurement between documents. Each word image is represented by a feature string. Then, an inexact string matching technology is utilized to measure the similarity between the two feature strings generated from two word images, based on which we can estimate how one word image is relevant to the other one and thereby decide whether one is a portion of the other word. The approach is applied to two issues in the area of document information retrieval: word spotting and document similarity measurement. Experimental results on real document images show that it is a promising approach.
What problem does this paper attempt to address?