Unsupervised translation disambiguation based on mining Web relatedness of bilingual words

刘鹏远,赵铁军
DOI: https://doi.org/10.3772/j.issn.1002-0470.2010.04.004
2010-01-01
Abstract:This paper presents an unsupervised method by mining Web relatedness of bilingual words. It intends to solve the problem of knowledge acquisition and data sparse in translation disambiguation. By introducing an indirect association model of bilingual words first, this paper expands it to bilingual web page. It goes a step further to a bilingual Web relatedness which centers around Web pages. It computes point-wise mutual information between words as relatedness and makes disambiguation by constructing different queries and extracting Web page counts through search engine. This method achieves the best performance. It outperforms the best unsupervised system TorMd on Semeval-2007 Task # 5 and gets the state-of-the-art results (Pmar= 0.464).
What problem does this paper attempt to address?