SemEval-2021 Task 1: Lexical Complexity Prediction

Matthew Shardlow,Richard Evans,Gustavo Henrique Paetzold,Marcos Zampieri
DOI: https://doi.org/10.48550/arXiv.2106.00473
2021-06-01
Abstract:This paper presents the results and main findings of SemEval-2021 Task 1 - Lexical Complexity Prediction. We provided participants with an augmented version of the CompLex Corpus (Shardlow et al 2020). CompLex is an English multi-domain corpus in which words and multi-word expressions (MWEs) were annotated with respect to their complexity using a five point Likert scale. SemEval-2021 Task 1 featured two Sub-tasks: Sub-task 1 focused on single words and Sub-task 2 focused on MWEs. The competition attracted 198 teams in total, of which 54 teams submitted official runs on the test data to Sub-task 1 and 37 to Sub-task 2.
Computation and Language
What problem does this paper attempt to address?