Quality Evaluation of Public NLP Dataset

WANG Chengwen,DONG Qingxiu,SUI Zhifang,ZHAN Weidong,CHANG Baobao,WANG Haitao
DOI: https://doi.org/10.3969/j.issn.1003-0077.2023.02.003
2023-01-01
Abstract:Pubic NLP datasets form the bedrock for NLP evaluation tasks, and the quality of such datasets has a fundamental impact on the development of evaluation tasks and the application of evaluation metrics. In this paper, we analyze and summarize eight types of problems relating to publicly available mainstream Natural Language Processing(NLP) datasets. Inspired by the quality assessment of testing in education community, we propose a series of evaluation metrics and evaluation methods combining computational and operational approaches, with the aim of providing a reference for the construction, selection and utilization of natural language processing datasets.
What problem does this paper attempt to address?