Combining image and text features: a hybrid approach to mobile book spine recognition.

Sam S. Tsai,David M. Chen,Huizhong Chen,Cheng-Hsin Hsu,Kyu-Han Kim,Jatinder Pal Singh,Bernd Girod
DOI: https://doi.org/10.1145/2072298.2071930
2011-01-01
Abstract:Despite the successful use of local image features for large-scale object recognition, they are not effective in recognizing book spines on bookshelves. This is because some book spines contain only text components that do not yield distinguishing image features. To overcome this issue, we develop a new approach that combines a text-based spine recognition pipeline with an image feature-based spine recognition pipeline. The text within the book spine image is recognized and used as keywords to search a book spine text database. The image features of the book spine image are searched through a book spine image database. The search results of the two approaches are then carefully combined to form the final result. We implement the proposed hybrid book recognition pipeline used in a book inventory management system, and conduct extensive experiments to evaluate its performance. The experimental results show that while text-based or image feature-based systems only achieve a recall of 72%, the proposed hybrid system achieves a recall of ~91%.
What problem does this paper attempt to address?