Docchat: an Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

Zhao Yan,Nan Duan,Junwei Bao,Peng Chen,Ming Zhou,Zhoujun Li,Jianshe Zhou
DOI: https://doi.org/10.18653/v1/p16-1049
2016-01-01
Abstract:Most current chatbot engines are designed to reply to user utterances based on existing utterance-response (or Q-R) 1 pairs. In this paper, we present DocChat, a novel information retrieval approach for chat-bot engines that can leverage unstructured documents, instead of Q-R pairs, to respond to utterances. A learning to rank model with features designed at different levels of granularity is proposed to measure the relevance between utterances and responses directly. We evaluate our proposed approach in both English and Chinese: (i) For English, we evaluate Doc-Chat on WikiQA and QASent, two answer sentence selection tasks, and compare it with state-of-the-art methods. Reasonable improvements and good adaptability are observed. (ii) For Chinese, we compare DocChat with XiaoIce 2 , a famous chitchat engine in China, and side-by-side evaluation shows that DocChat is a perfect complement for chatbot engines using Q-R pairs as main source of responses.
What problem does this paper attempt to address?