Abstract:Along with the ever-growing computational power of mobile devices, mobile visual search has undergone an evolution in techniques and applications. A significant trend is low bit rate visual search, where compact visual descriptors are extracted directly over a mobile and delivered as queries rather than raw images to reduce the query transmission latency. In this article, we introduce our work on low bit rate mobile landmark search, in which a compact yet discriminative landmark image descriptor is extracted by using a location context such as GPS, crowd-sourced hotspot WLAN, and cell tower locations. The compactness originates from the bag-of-words image representation, with off line learning from geotagged photos from online photo-sharing websites including Flickr and Panoramio. The learning process involves segmenting the landmark photo collection by discrete geographical regions using a Gaussian mixture model and then boosting a ranking-sensitive vocabulary within each region, with "entropy"-based feedback on the compactness of the descriptor to refine both phases iteratively. In online search, when entering a geographical region, the code book in a mobile device is downstream adapted to generate extremely compact descriptors with promising discriminative ability. We have deployed landmark search apps to both HTC and iPhone mobile phones, accessing a database of a million scale images in typical areas like Beijing, New York, and Barcelona, and others. Our descriptor outperforms alternative compact descriptors (Chen et al. 2009; Chen et al., 2010; Chandrasekhar et al. 2009a; Chandrasekhar et al. 2009b) by significant margins. Beyond landmark search, this article will summarize the MPEG standarization progress of compact descriptor for visual search (CDVS) (Yuri et al. 2010; Yuri et al. 2011) toward application interoperability.

Training dataset Preprocessing SIFT descriptor SMPT Grassman Pruning Entropy encoder SIFT descriptor Training Bit Stream

Mobile Visual Search Compression with Grassmann Manifold Embedding

KPB-SIFT

Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.

Medical Image Retrieval Using Sift Feature

Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search

Multi-stage vector quantization towards low bit rate visual search

Visual word expansion and BSIFT verification for large-scale image search

A Comparative Study of SIFT and Its Variants

Geometric Context-Preserving Progressive Transmission in Mobile Visual Search

Scalable Object Retrieval with Compact Image Representation from Generic Object Regions

On The Interoperability Of Local Descriptors Compression

Visual Query Compression With Locality Preserving Projection On Grassmann Manifold

Pruning Tree-Structured Vector Quantizer Towards Low Bit Rate Mobile Visual Search

PQ-WGLOH: A bit-rate scalable local feature descriptor

Optimizing JPEG Quantization Table for Low Bit Rate Mobile Visual Search

Depth-based Local Feature Selection for Mobile Visual Search

Learning Compact Visual Descriptors For Low Bit Rate Mobile Landmark Search

USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval

Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search

Smart Query Expansion Scheme for CDVS Based on Illumination and Key Features