Research on Quality Evaluation of Chinese Spatial Semantic Understanding Evaluation Dataset

YUE Pengxue,WANG Chengwen,SUN Chunhui,ZHAN Weidong,SUI Zhifang
DOI: https://doi.org/10.16499/j.cnki.1003-5397.2023.01.006
IF: 3.6
2023-01-01
Applied Linguistics
Abstract:Chinese Spatial Semantic Comprehension Ability Evaluation(SpaCE2021) can be regarded as an essential attempt to evaluate human-like machine language ability. We generate a large amount of corpus rich in spatial meaning by replacing words with spatial orientation meanings and constructs a Chinese spatial semantic understanding evaluation dataset. In this paper, we analyze the characteristics of our proposed dataset from four aspects: the original sentences of the generated test samples, the structure types of replaceable words and replaceable words, the spatial orientation types of the test set samples, and the correct and wrong distribution of the samples. Then, through the comparison of human and participating machine systems, the performance of machines on different types of spatial vocabulary is analyzed in detail, and the general rules of machine spatial semantic understanding are summarized. A high-quality Chinese spatial semantic evaluation data-set makes specific recommendations. The above work helps improve the quality of spatial semantic evaluation datasets, thereby improving the accuracy and reliability of related evaluation tasks.
What problem does this paper attempt to address?