A Memory-Based Sentence Split and Rephrase Model with Multi-task Training

Xiaoning Fan,Yiding Liu,Gongshen Liu,Bo Su
DOI: https://doi.org/10.1007/978-3-030-63830-6_54
2020-01-01
Abstract:The task of sentence split and rephrase refers to breaking down a complex sentence into some simple sentences with the same semantic information, which is a basic preprocess method for simplification in many natural language processing (NLP) fields. Previous works mainly focus on applying conventional sequence-to-sequence models into this task, which fails to capture relations between entities and lacks memory of the decoded parts, and thus causes duplication of generated subsequences and confuses the relationship between subjects and objects. In this paper, we introduce a memory-based Transformer model with multi-task training to improve the accuracy of the sentence information obtained by the encoder. To enrich the semantic representation of the model, we further incorporated a conditional Variational Autoencoder (VAE) component to our model. Through experiments on the WebSplit-v1.0 benchmark dataset, results show that our proposed model outperforms other state-of-the-art baselines from both BLEU and human evaluations.
What problem does this paper attempt to address?