Alleviating Hallucinations Via Supportive Window Indexing in Abstractive Summarization

Jiaxin Duan,Fengyu Lu,Junfei Liu
DOI: https://doi.org/10.1109/icassp48485.2024.10446022
2024-01-01
Abstract:Abstractive summarization models learned with maximum likelihood estimation (MLE) have been proven to produce hallucinatory content, which heavily limits their real-world applicability. Preceding studies attribute this problem to the semantic insensitivity of MLE, and they compensate for it with additional unsupervised learning objectives that maximize the metrics of document-summary inferring, however, resulting in unstable and expensive model training. In this paper, we propose a novel supportive windows indexing grounded summarization (SWIGS) paradigm, where an input document is split into several windows, and a summarization model orderly generates the indices of supportive windows before each summary sentence. Because the supportive windows locate the source information closely related to the summary sentence to be generated, pointing out their indices at first helps to ground the evidence-based summary generation, thus alleviating groundless hallucinations. We create only supervised objectives to learn the SWIGS model and conduct extensive experiments on two well-known datasets to validate its effectiveness. Results vouch for the superiority of SWIGS as it outperforms previous methods regarding multiple metrics.
What problem does this paper attempt to address?