Abstract:Maintaining an accurate and up-to-date inventory of one's assets is a labor-intensive, tedious, and costly operation. To ease this difficult but important task, we design and implement a mobile asset tracking system for automatically generating an inventory by snapping photos of the assets with a smartphone. Since smartphones are becoming ubiquitous, construction and deployment of our inventory management solution is simple and cost-effective. Automatic asset recognition is achieved by first segmenting individual assets out of the query photo and then performing bag-of-visual-features (BoVF) image matching on the segmented regions. The smartphone's sensor readings, such as digital compass and accelerometer measurements, can be used to determine the location of each asset, and this location information is stored in the inventory for each recognized asset.As a special case study, we demonstrate a mobile book tracking system, where users snap photos of books stacked on bookshelves to generate a location-aware book inventory. It is shown that segmenting the book spines is very important for accurate feature-based image matching into a database of book spines. Segmentation also provides the exact orientation of each book spine, so more discriminative upright local features can be employed for improved recognition. This system's mobile client has been implemented for smartphones running the Symbian or Android operating systems. The client enables a user to snap a picture of a bookshelf and to subsequently view the recognized spines in the smartphone's viewfinder. Two different pose estimates, one from BoVF geometric matching and the other from segmentation boundaries, are both utilized to accurately draw the boundary of each spine in the viewfinder for easy visualization. The BoVF representation also allows matching each photo of a bookshelf rack against a photo of the entire bookshelf, and the resulting feature matches are used in conjunction with the smartphone's orientation sensors to determine the exact location of each book.

Combining image and text features: a hybrid approach to mobile book spine recognition.

Image Classification Method by Combining Multi-features and Sparse Coding

Cross-Reading by Leveraging a Hybrid Index of Heterogeneous Information.

Texture-specific Bag of Visual Words Model and Spatial Cone Matching-Based Method for the Retrieval of Focal Liver Lesions Using Multiphase Contrast-Enhanced CT Images

Sparse Codebook Model of Local Structures for Retrieval of Focal Liver Lesions Using Multiphase Medical Images

A Mixed Approach to Book Splitting.

Building book inventories using smartphones.

Image-text matching for large-scale book collections

Mobile augmented reality for books on a shelf

Low-Cost Asset Tracking Using Location-Aware Camera Phones

Exploiting Feature Correspondence Constraints for Image Recognition

A Hybrid Text Segmentation Approach

Discriminative and generative vocabulary tree: With application to vein image authentication and recognition

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

Image-text matching using multi-subspace joint representation

Combining Eye Movements for Semantic Image Classification

Automatic Recognition Technology of Library Books Based on Convolutional Neural Network Model

Integration of Text Information and Graphic Composite for PDF Document Analysis

Learning to Read by Spelling: Towards Unsupervised Text Recognition

Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel

A New Multi-Modal Approach to Bib Number/text Detection and Recognition in Marathon Images