Abstract:Data-driven approaches to sequence-to-sequence modelling have been successfully applied to short text summarization of news articles. Such models are typically trained on input-summary pairs consisting of only a single or a few sentences, partially due to limited availability of multi-sentence training data. Here, we propose to use scientific articles as a new milestone for text summarization: large-scale training data come almost for free with two types of high-quality summaries at different levels - the title and the abstract. We generate two novel multi-sentence summarization datasets from scientific articles and test the suitability of a wide range of existing extractive and abstractive neural network-based summarization approaches. Our analysis demonstrates that scientific papers are suitable for data-driven text summarization. Our results could serve as valuable benchmarks for scaling sequence-to-sequence models to very long sequences.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to explore the applicability of scientific articles as a new benchmark for data - driven text summarization. Specifically, the paper proposes two new multi - sentence summarization datasets, which are sourced from scientific articles, and tests the performance of a series of existing extractive and generative neural network summarization methods on these datasets. Through this method, the paper aims to evaluate whether scientific articles are suitable for data - driven text summarization and provide a valuable benchmark for extending sequence - to - sequence models to handle very long sequences. The paper mainly focuses on the following points: 1. **Scientific articles as a new source of summary data**: The paper proposes using scientific articles as a new milestone for data - driven text summarization, because scientific articles usually come with high - quality summaries and titles and can be used as training data. 2. **Constructing new datasets**: The paper constructs two new large - scale multi - sentence summarization datasets: - **title - gen**: It contains 5 million pairs of article titles and summaries in the biomedical field. - **abstract - gen**: It contains 900,000 pairs of article summaries and bodies. 3. **Evaluating existing methods**: The paper evaluates the performance of a series of existing extractive and generative neural network summarization methods on these new datasets, including extractive methods based on word embeddings and generative methods based on Recurrent Neural Networks (RNN) and Convolutional Neural Networks (CNN). 4. **Performance analysis**: The paper analyzes the outputs of these models through quantitative and qualitative methods, especially focusing on the performance of the models when dealing with long input / output sequence pairs. Overall, the goal of this paper is to promote research in the field of scientific article summarization and provide a basis for developing new models that can efficiently handle long input and output sequences.

Data-driven Summarization of Scientific Articles

A Supervised Approach to Extractive Summarisation of Scientific Papers

Automatic Document Summarization Via Deep Neural Networks

Synthesizing Scientific Summaries: An Extractive and Abstractive Approach

Topic-Centric Unsupervised Multi-Document Summarization of Scientific and News Articles

SKT5SciSumm -- Revisiting Extractive-Generative Approach for Multi-Document Scientific Summarization

Combination of abstractive and extractive approaches for summarization of long scientific texts

Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond

Neural Summarization by Extracting Sentences and Words

CiteSum: Citation Text-guided Scientific Extreme Summarization and Domain Adaptation with Limited Supervision

Enhancing Scientific Papers Summarization with Citation Graph

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Neural Sequence-to-Sequence Modeling with Attention by Leveraging Deep Learning Architectures for Enhanced Contextual Understanding in Abstractive Text Summarization

Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers

An Overview of Natural Language Processing Models for Abstractive Text Summarization.

Leveraging Information Bottleneck for Scientific Document Summarization

Automated News Summarization Using Transformers

Towards a Neural Network Approach to Abstractive Multi-Document Summarization.

A Novel Approach to Text Summarization Using Machine Learning

Summarizing large-scale, multiple-document news data: sparse methods and human validation