Comparison of Data Standardization Method in Semantic Relation Similarity Calculation

WANG Zheng-peng,XIE Zhi-peng,QIU Pei-chao
DOI: https://doi.org/10.3969/j.issn.1000-3428.2012.10.010
2012-01-01
Abstract:This paper researches the influence of the data standardization for semantic relation similarity calculation.It extracts lexical pattern from huge text corpus,generates the word pair-lexical pattern matrix,employs three methods to standard the original data matrix,and uses law study method to calculate the similarity between relations.Experimental result shows that without any standardization,the classification task with a statistically significant average precision score is 0.87,z-score standardization is 0.89,interval standardization is 0.95,and weighted based on entropy is 0.96.
What problem does this paper attempt to address?