Efficient Two-stage Approach for Long Document Summarization

Stanford CS224N Custom,Ling Sui,Jialuo Yuan,Fengming Tang,Elaine Yi,Benson Zu
Abstract:Long document summarization is essential in NLP, but state-of-the-art models like BART and BERT face limitations summarizing long documents. Current methods like Longformer Encoder Decoder (LED) can handle longer document summarization but suffers from slower processing times due to their complex attention mechanisms. To address this challenge, we propose a two-stage approach that combines sentence extraction algorithms with BART for generating abstractive summaries. Our approach leverages the efficiency of extraction algorithms to identify key sentences from the input document. BART then generates more coherent and informative abstractive summaries from these extracted sentences. Our experimental results show that our approach is four times more time-efficient than the LED baseline while processing the same amount of data, with approximately the same performance in terms of Rouge 1,2, and L F-measure. With the hope of further improving the generated summaries, we also use Generative Adversarial Network to train the model. Our proposed approach has important implications for NLP applications that require summarization of long text, such as legal documents or scientific papers.
Computer Science
What problem does this paper attempt to address?