Corpus and Models for Lemmatisation and POS-tagging of Old French

Jean-Baptiste Camps,Thibault Clérice,Frédéric Duval,Lucence Ing,Naomi Kanaoka,Ariane Pinche
DOI: https://doi.org/10.48550/arXiv.2109.11442
2021-09-23
Computation and Language
Abstract:Old French is a typical example of an under-resourced historic languages, that furtherly displays animportant amount of linguistic variation. In this paper, we present the current results of a long going project (2015-...) and describe how we broached the difficult question of providing lemmatisation andPOS models for Old French with the help of neural taggers and the progressive constitution of dedicated corpora.
What problem does this paper attempt to address?