English-Chinese Bi-Directional OOV Translation based on Web Mining and Supervised Learning.

Yuejie Zhang,Yang Wang,Xiangyang Xue
DOI: https://doi.org/10.3115/1667583.1667624
2009-01-01
Abstract:In Cross-Language Information Retrieval (CLIR), Out-of-Vocabulary (OOV) detection and translation pair relevance evaluation still remain as key problems. In this paper, an English-Chinese Bi-Directional OOV translation model is presented, which utilizes Web mining as the corpus source to collect translation pairs and combines supervised learning to evaluate their association degree. The experimental results show that the proposed model can successfully filter the most possible translation candidate with the lower computational cost, and improve the OOV translation ranking effect, especially for popular new words.
What problem does this paper attempt to address?