Query Error Correction Algorithm Based On Fusion Sequence To Sequence Model

Jianyong Duan,Tianxiao Ji,Mingli Wu,Hao Wang
DOI: https://doi.org/10.1007/978-3-030-28374-2_2
2019-01-01
Abstract:The query error correction task is very important to improve user satisfaction and quality of query results. In traditional query error correction methods researchers mostly use a pipeline way to correct the error step by step. They rely heavily on manual annotation corpora. It is difficult to take into account the global effect. In this paper, we present a character-based end-to-end Sequence to Sequence (Seq2Seq) method with attention mechanism. It also incorporates the neural network language model trained on unlabeled corpora to solve the task of query correction. It can unify the modeling of different error types in query error correction and effectively overcome the shortcomings of traditional methods in query error correction tasks. Experiments show that this method can effectively capture the long-distance knowledge to correct errors, and through the Simple Recurrent Unit (SRU) it can be as good as Long Short-Term Memory (LSTM). However, there has been a significant improvement in processing time. This point is very important in query error correction tasks.
What problem does this paper attempt to address?