Learning Phrase-Based Spelling Error Models from Clickthrough Data.

Xu Sun,Jianfeng Gao,Daniel Micol,Chris Quirk
2010-01-01
Abstract:This paper explores the use of clickthrough data for query spelling correction. First, large amounts of query-correction pairs are derived by analyzing users' query reformulation behavior encoded in the clickthrough data. Then, a phrase-based error model that accounts for the transformation probability between multi-term phrases is trained and integrated into a query speller system. Experiments are carried out on a human-labeled data set. Results show that the system using the phrase-based error model outperforms significantly its baseline systems.
What problem does this paper attempt to address?