Survey of Query-based Text Summarization

Hang Yu,Jiawei Han
2024-10-07
Abstract:Query-based text summarization is an important real world problem that requires to condense the prolix text data into a summary under the guidance of the query information provided by users. The topic has been studied for a long time and there are many existing interesting research related to query-based text summarization. Yet much of the work is not systematically surveyed. This survey aims at summarizing some interesting work in query-based text summarization methods as well as related generic text summarization methods. Not all taxonomies in this paper exist the related work to the best of our knowledge and some analysis will be presented.
Information Retrieval,Computation and Language
What problem does this paper attempt to address?
This paper aims to address the problem of query-based text summarization. Specifically: - **Research Background and Motivation**: With the development of the internet, the speed of information dissemination has accelerated, and the vast amount of text data makes it difficult for users to filter out useful information. Traditional automatic text summarization methods can summarize documents, but in practical applications, users are often only interested in specific aspects of the document, which may not be the main parts of the document. Therefore, solely relying on general text summarization methods may result in summaries that do not contain the information of interest to the user. - **Main Objective**: Query-based text summarization uses user-provided query information to guide the automated summarization process, ensuring that the generated summary includes the content of interest to the user. The goal of the paper is to systematically summarize the related research work on query-based text summarization and explore how to better utilize query information to improve the quality and relevance of the summaries. - **Research Scope**: The paper covers query-based extractive summarization and query-based abstractive summarization, and reviews related unsupervised and supervised learning methods. Additionally, it discusses evaluation methods and technological advancements, such as the ROUGE scoring standard. In summary, this paper aims to fill the current research gap in the field of query-based text summarization and provide a comprehensive reference framework for subsequent researchers.