Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges.

Abudukelimu Halidanmu,Abulizi Abudoukelimu,Boliang Zhang,Xiaoman Pan,Di Lu,Heng Ji,Yang Liu
2018-01-01
Language Resources and Evaluation
Abstract:Regardless of numerous efforts at name tagging for Uyghur, there is limited understanding on the performance ceiling. In this paper, we take a close look at the successful cases and perform careful analysis on the remaining errors of a state-of-the-art Uyghur name tagger, systematically categorize challenges, and propose possible solutions. We conclude that simply adopting a machine learning model which is proven successful for high-resource languages along with language-independent superficial features is unlikely to be effective for Uyghur, or low-resource languages in general. Further advancement requires exploiting rich language-specific knowledge and non-traditional linguistic resources, and novel methods to encode them into machine learning frameworks.
What problem does this paper attempt to address?