Optimizing Feature Encoding for Self-Organizing Chinese Semantic Maps

Min Zhang,Cst Dep
2003-01-01
Abstract:In this paper, we introduce self-organizing Chinese semantic map, then study and propose six different approaches of feature encoding which is crucial to the performance of a SOM. The approaches are based on set theory, algebra, and probabilistic theory respectively. We conclude from the evaluation results that the method of combining frequency density approach and TFIDF approach has the best performance with 94.4% of precision and 90.7% of recall on semantic mapping, and vector space oriented approaches are not suitable for the task. Analyses of results are also given. Comparative experiments show that the best approach in this paper is better than conventional hierarchy clustering technique, and much better than multivariate statistical analyses such as principle component analyses on dimension reduction based feature encoding.
What problem does this paper attempt to address?