Bilingual Topic Taxonomy Generation Based on Bilingual Documents Clustering

Cheng-Zhi Zhang
DOI: https://doi.org/10.1109/icmlc.2011.6016948
2011-01-01
Abstract:Bilingual taxonomy is one of key components of multilingual Ontology. In this paper, affinity propagation clustering algorithm is used to cluster bilingual documents collection and generate bilingual topic taxonomy. Two bilingual topic taxonomy generation methods, i.e. bilingual documents clustering before or after text feature reconstruction, are described. Dataset in two domains are tested and result shows that: according to net similarity, the result of documents clustering after feature reconstruction is better than that before feature reconstruction.
What problem does this paper attempt to address?