Sparse Probability of Agreement

Jeppe Nørregaard,Leon Derczynski
DOI: https://doi.org/10.48550/arXiv.2208.06161
2022-08-12
Computation and Language
Abstract:Measuring inter-annotator agreement is important for annotation tasks, but many metrics require a fully-annotated set of data, where all annotators annotate all samples. We define Sparse Probability of Agreement, SPA, which estimates the probability of agreement when not all annotator-item-pairs are available. We show that under certain conditions, SPA is an unbiased estimator, and we provide multiple weighing schemes for handling data with various degrees of annotation.
What problem does this paper attempt to address?