Abstract:Along with the ever-growing computational power of mobile devices, mobile visual search has undergone an evolution in techniques and applications. A significant trend is low bit rate visual search, where compact visual descriptors are extracted directly over a mobile and delivered as queries rather than raw images to reduce the query transmission latency. In this article, we introduce our work on low bit rate mobile landmark search, in which a compact yet discriminative landmark image descriptor is extracted by using a location context such as GPS, crowd-sourced hotspot WLAN, and cell tower locations. The compactness originates from the bag-of-words image representation, with off line learning from geotagged photos from online photo-sharing websites including Flickr and Panoramio. The learning process involves segmenting the landmark photo collection by discrete geographical regions using a Gaussian mixture model and then boosting a ranking-sensitive vocabulary within each region, with "entropy"-based feedback on the compactness of the descriptor to refine both phases iteratively. In online search, when entering a geographical region, the code book in a mobile device is downstream adapted to generate extremely compact descriptors with promising discriminative ability. We have deployed landmark search apps to both HTC and iPhone mobile phones, accessing a database of a million scale images in typical areas like Beijing, New York, and Barcelona, and others. Our descriptor outperforms alternative compact descriptors (Chen et al. 2009; Chen et al., 2010; Chandrasekhar et al. 2009a; Chandrasekhar et al. 2009b) by significant margins. Beyond landmark search, this article will summarize the MPEG standarization progress of compact descriptor for visual search (CDVS) (Yuri et al. 2010; Yuri et al. 2011) toward application interoperability.

Enabling Low Bitrate Mobile Visual Recognition

Enabling Low Bitrate Mobile Visual Recognition: a Performance Versus Bandwidth Evaluation.

Edge Segmentation: Empowering Mobile Telemedicine with Compressed Cellular Neural Networks

Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search

Towards low bit rate mobile visual search with multiple-channel coding.

Low-rate image retrieval with tree histogram coding

Learning Compact Visual Descriptors For Low Bit Rate Mobile Landmark Search

Video Surveillance on Mobile Edge Networks—A Reinforcement-Learning-Based Approach

Intelligent Video Surveillance Based on Mobile Edge Networks

Sorting Local Descriptors for Lowbit Rate Mobile Visual Search

Optimizing JPEG Quantization Table for Low Bit Rate Mobile Visual Search

Multi-stage vector quantization towards low bit rate visual search

Joint Optimization of JPEG Quantization Table and Coefficient Thresholding for Low Bitrate Mobile Visual Search

Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.

Beyond Visual Retargeting: A Feature Retargeting Approach for Visual Recognition and Its Applications.

Learning Salient Visual Word for Scalable Mobile Image Retrieval.

Understanding Sensor Data Using Deep Learning Methods on Resource-Constrained Edge Devices.

An Efficient and Low Power Deep Learning Framework for Image Recognition on Mobile Devices

Extreme Low Bitrate Image Compression System for Mobile Deployment

Learning Multiple Codebooks for Low Bit Rate Mobile Visual Search

Spatial Verification for Scalable Mobile Image Retrieval