WikiMulti: a Corpus for Cross-Lingual Summarization

Pavel Tikhonov,Valentin Malykh
DOI: https://doi.org/10.48550/arXiv.2204.11104
2022-04-24
Abstract:Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: <a class="link-external link-https" href="https://github.com/tikhonovpavel/wikimulti" rel="external noopener nofollow">this https URL</a>
Computation and Language
What problem does this paper attempt to address?