The Cross-lingual Conversation Summarization Challenge

Yulong Chen,Ming Zhong,Xuefeng Bai,Naihao Deng,Jing Li,Xianchao Zhu,Yue Zhang
DOI: https://doi.org/10.48550/arXiv.2205.00379
2022-05-03
Abstract:We propose the shared task of cross-lingual conversation summarization, \emph{ConvSumX Challenge}, opening new avenues for researchers to investigate solutions that integrate conversation summarization and machine translation. This task can be particularly useful due to the emergence of online meetings and conferences. We construct a new benchmark, covering 2 real-world scenarios and 3 language directions, including a low-resource language. We hope that \emph{ConvSumX} can motivate researches to go beyond English and break the barrier for non-English speakers to benefit from recent advances of conversation summarization.
Computation and Language
What problem does this paper attempt to address?
This paper aims to solve the problem of cross - lingual conversation summarization. Specifically, the author proposes a new task named ConvSumX Challenge, which requires the system to be able to receive conversation texts in one language as input and generate summaries in another language as output. This task is particularly useful because with the increase of online meetings and international conferences, non - native English speakers need to be able to understand and participate in the content of these meetings. By constructing a new benchmark dataset covering two real - life scenarios and three language directions (including low - resource languages), the ConvSumX Challenge hopes to inspire researchers to develop conversation summarization technologies beyond English and break the barrier that non - English speakers have difficulty benefiting from the latest conversation summarization research. The main contributions of the paper are as follows: 1. **Proposing a new task**: Defining the task of cross - lingual conversation summarization, which is an important extension of the existing conversation summarization research. 2. **Constructing a new dataset**: Creating a benchmark dataset with multiple language directions, especially covering low - resource languages. 3. **Promoting technological development**: Hoping to promote the development of conversation summarization and machine translation technologies through this challenge, especially in dealing with cross - language data. 4. **Practical application value**: Emphasizing the importance of this task in practical application scenarios such as international conferences and online education, especially during the epidemic when international exchanges mainly rely on online platforms. In conclusion, the goal of this paper is to promote the development of cross - lingual conversation summarization technologies through proposing the ConvSumX Challenge, so that non - English speakers can also benefit from these technologies.