Named Entity Recognition Based on Bilingual Co-training.

Yegang Li,Heyan Huang,Xingjian Zhao,Shumin Shi
DOI: https://doi.org/10.1007/978-3-642-45185-0_50
2013-01-01
Abstract:Named entity recognition (NER) is a very important task in natural language processing (NLP). In this paper we present a semi-supervised approach to extract bilingual named entity, starting from a bilingual corpus where the named entities are extracted independently for each language. Then a bilingual co-training algorithm is used to improve the named entity annotation quality, and iterative process is applied to extract named entity pairs with higher bilingual conformity ratio. This leads to a significant improvement of the monolingual named entity annotation quality for both languages. Experimental result shows that the annotation quality of Chinese NE is improved from 87.17 to 88.28, and improved 80.37 to 81.76 of English NE in F-measure. © 2013 Springer-Verlag.
What problem does this paper attempt to address?