Research on Lucene-based English-Chinese Cross-Language Information Retrieval

Yuejie Zhang,Tao Zhang,Shijie Chen
2005-01-01
Abstract:In this paper, we present our English-Chinese Cross-Language Information Retrieval (CLIR) system. We focus our attention on finding effective translation equivalents between English and Chinese, and improving the performance of Chinese IR. On English-Chinese CLIR, we adopt query translation as the dominant strategy, and utilize English-Chinese bilingual dictionary as the important knowledge resource to acquire correct translations. On Chinese monolingual retrieval, we investigated the use of different entities as indexes and implement our retrieval system based on the Lucene toolkit. On system evaluation, we present an effective method to generate the sets of relevant documents for query topics.
What problem does this paper attempt to address?