CSFQGD: Chinese Sentence Fill-in-the-blank Question Generation Dataset for Examination

Tianlin Zhang,Zhenyu Cui,Jiaxu Leng,Ying Liu
DOI: https://doi.org/10.1109/CSCWD49262.2021.9437824
2021-01-01
Abstract:Fill-in-the-blank question generation has become enormously popular and attracted lots of attention recently. However, most of the existing question generation datasets are developed for machine reading comprehension, which are not specifically d esigned f or e xamination. To fi ll in th e ga p, in this paper, we propose a Chinese sentence fill-in-the-blank question generation dataset for examination (named CSFQGD), which will be released to the public (1). The dataset is composed of 20.5K questions from many real examinations in Chinese that cover a wide spectrum of learning subjects. Based on the proposed dataset, we test several well-known methods for fill-in-the-blank question generation and compare their performance. Our baseline study on this dataset shows that CSFQGD is a challenging test bed for further research.
What problem does this paper attempt to address?