ConvTrans: Transforming Web Search Sessions for Conversational Dense Retrieval.

Kelong Mao,Zhicheng Dou,Hongjin Qian,Fengran Mo,Xiaohua Cheng,Zhao Cao
DOI: https://doi.org/10.18653/v1/2022.emnlp-main.190
2022-01-01
Abstract:Conversational search provides users with natural and convenient new search experience.Recently, conversational dense retrieval has shown to be a promising technique to realize conversational search.However, as conversational search systems have not been widely deployed, it is hard to get large-scale real conversational search sessions and relevance labels to support the training of conversational dense retrieval.To tackle this data scarcity problem, previous methods focus on developing better few-shot learning approaches or generating pseudo relevance labels, but the data they use for training still heavily rely on manual generation.In this paper, we present ConvTrans, a data augmentation method that can automatically transform easily-accessible web search sessions into conversational search sessions to fundamentally alleviate the data scarcity problem for conversational dense retrieval.ConvTrans eliminates the gaps between these two types of sessions in terms of session quality and query form to achieve effective session transformation.Extensive evaluations on two widely used conversational search benchmarks, i.e., , demonstrate that the same model trained on the data generated by Con-vTrans can achieve comparable retrieval performance as it trained on high-quality but expensive artificial conversational search data.
What problem does this paper attempt to address?