Abstract:Most effective particular object and image retrieval approaches are based on the bag-of-words (BoW) model, and all state-of-the-art performance mainly involves a query expansion procedure, which is able to significantly improve retrieval results. Nowadays, Convolutional Neural Network(CNN) is widely applied in computer vision field, including image classification, caption, recognition and retrieval, etc. We introduce an extension to query expansion: an automatic method to select good candidate samples for interactive annotation which is used in query expansion using both BoW method and CNN feature. In this work, we address the query expansion framework using active learning, where the main focus is on the sample selection step in the process of query expansion. More specifically, we propose an active sample selection algorithm based on binary relevance classification, based on the assumption that most confusing samples of the classifiers have high probability to contain helpful true positives for query expansion, which significantly improves the retrieval performance. It takes full use of the multimodal information of the shortlist obtained from the basic retrieval to train a binary relevance classifier, which is used to pick up the most confusing samples for human annotation, with top list as unlabeled data and bottom list as fake negatives. And it can achieve a faster and better retrieval than naive top sample selection method. We also fuse BoW vector and CNN prediction in the retrieval system for a better performance. To evaluate the performance of our proposed method, experiments are conducted on Standard Oxford (5K and 105K) and Paris (6K) datasets, and experimental results and comparison with the state-of-the-art methods demonstrate the effectiveness of the proposed method.

Bagging to find better expansion words

Machine Learning Based Query Expansion in Blog Retrieval

Improving short text classification using public search engines

Exploiting Semantic Knowledge Base for Patent Retrieval

Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering.

Query Expansion by Spatial Co-Occurrence for Image Retrieval

Improving Query Expansion Using WordNet

Concept Based Query Expansion Using Wordnet

Query Expansion for Object Retrieval with Active Learning Using BoW and CNN Feature

Query Expansion Based on Clustered Results

Selecting Expansion Terms As a Set Via Integer Linear Programming

Concept based query expansion using hidden markov model

Probabilistic Query Expansion Using Query Logs

Selecting Query-bag as Pseudo Relevance Feedback for Information-seeking Conversations

Systematic Study on Query Expansion

Improving Question Answering Based on Query Expansion with Wikipedia

Towards Unsupervised Semantic Retrieval Of Spoken Content With Query Expansion Based On Automatically Discovered Acoustic Patterns

BERT-QE: Contextualized Query Expansion for Document Re-ranking

Query expansion based on Web knowledge base and search engine

Specific Academic Area Based Automatic Query Expansion

Using WordNet in Conceptual Query Expansion