An Approach to Extract Named Entity Translingual Equivalence

陈怀兴,尹存燕,陈家骏
DOI: https://doi.org/10.3969/j.issn.1003-0077.2008.04.009
2008-01-01
Abstract:Identification of translingual equivalence of named entities is substantial to multilingual natural language processing.Some approaches to named entity translation,such as bilingual dictionary lookup,word/sub-word translation or transliteration,have been explored in the past years.Another promising approach is to extract named entity translingual equivalence automatically from a parallel corpus,which usually requires the named entities to be annotated manually or automatically for both languages.In this paper,we propose a new approach to extract equivalence of named entities from a parallel corpus with only the source language annotation and the result of HMM alignment.The experiment is carried in a Chinese-English parallel copus,and we treat Chinese as the source language and English as the target language.The result shows that our new approach achieves high quality of named entity pairs with relatively high precision,even though sometimes the word alignment result is partially correct.
What problem does this paper attempt to address?