Using sparse coding for answer summarization in non-factoid community question-answering

Zhaochun Ren,Hongya Song,Piji Li,Shangsong Liang,Jun Ma,Maarten de Rijke
2016-01-01
Abstract:We focus on the task of summarizing answers in community question-answering (CQA). While most previous work on answer summarization focuses on factoid question-answering, we focus on nonfactoid question-answering. In contrast to factoid CQA with a short and accurate answer, non-factoid question-answering usually requires passages as answers. The diversity, shortness and sparseness of answers form interesting challenges for summarization. To tackle these challenges, we propose a sparse coding-based summarization strategy, in which we can effectively capture the saliency of diverse, short and sparse units. Specifically, after transferring all candidate answer sentences into vectors, we present a coordinate descent learning method to optimize a loss function to reconstruct the input vectors as a linear combination of basis vectors. Experimental results on a benchmark data collection confirm the effectiveness of our proposed method in non-factoid CQA summarization. Our method is shown to significantly outperform the state-of-theart in terms of ROUGE metrics.
What problem does this paper attempt to address?