Abstract:Local search service (e.g. Yelp, Yahoo! Local) has emerged as a popular and effective paradigm for a wide range of information needs for local businesses; it now provides a viable and even more effective alternative to general purpose web search for queries on local businesses. However, due to the diversity of information needs behind local search, it is necessary to use different information retrieval strategies for different query types in local search. In this paper, we explore a taxonomy of local search driven by users' information needs, which categorizes local search queries into three types: business category, chain business, and non-chain business. To decide which search strategy to use for each category in this taxonomy without placing the burden on the web users, it is indispensable to build an automatic local query classifier. However, since local search queries yield few online features and it is expensive to obtain editorial labels, it is insufficient to use only a supervised learning approach. In this paper, we address these problems by developing a semi-supervised approach for mining information needs from a vast amount of unlabeled data from local query logs to boost local query classification. Results of a large scale evaluation over queries from a commercial local search site illustrate that the proposed semi-supervised method allow us to accurately classify a substantially larger proportion of local queries than the supervised learning approach.

Semi-supervised text classification with information retrieval techniques

Improving short text classification using public search engines

Improving semi-supervised text classification by using wikipedia knowledge

Research on Deep Web Classification Based on Domain Feature Text

An Overview on Supervised Semi-structured Data Classification

A taxonomy of local search: semi-supervised query classification driven by information needs.

Cross-Domain Knowledge Transfer Using Semi-supervised Classification

Automatic Query Classification Via Constructing Semantic Lexicon

A Closeness-Based Semi-Supervised Text Classification Method

A Survey of Semi-supervised Text Categonzation

Semi-supervised Learning for Image Retrieval Using Support Vector Machines

Chinese Short Text Categorization Based on Semi-Supervised Learning

Exploiting Text Content In Image Search By Semi-Supervised Learning Techniques

Data Security Search Based on Semi-Supervised Sensitive Classifier

Improving Image Retrieval with Semantic Classification Using Relevance Feedback

Multi-Modal Web Search Query Refinement Based on Semi-Supervised Learning

The Technology Research of The Semantic Text Classification

Study on Information Retrieval Mechanism Based on Semantic Web

Short Text Classification Based on Semi-Supervised Learning

Research On Effective Web Information Retrieval Based On Semantic Web

Semi-supervised document retrieval