Multi-lingualWikipedia Summarization and Title Generation on Low Resource Corpus

Wei Liu,Lei Li,Zuying Huang,Yinan Liu
DOI: https://doi.org/10.26615/978-954-452-058-8_004
2019-01-01
Abstract:MultiLing 2019 Headline Generation Task on Wikipedia Corpus raised a critical and practical problem: multilingual task on low resource corpus.In this paper we proposed Quality-Diversity Automatic Summarization(QDAS) model enhanced by sentence2vec and try to apply transfer learning based on large multilingual pre-trained language model for Wikipedia Headline Generation task.We treat it as sequence labeling task and develop two schemes to handle with it.Experimental results have shown that large pre-trained model can effectively utilize learned knowledge to extract certain phrase using low resource supervised data.
What problem does this paper attempt to address?