OPINAX : An Effective Product Attribute Mining System

HAO Bo-yi,XIA Yun-qing,ZHENG Fang
2008-01-01
Abstract:HAO Bo-yi, XIA Yun-qing, ZHENG Fang State Key Lab of Intelligent Technology and Systems Tsinghua University, Beijing 100084 E-mail: haoby@cslt.riit.tsinghua.edu.cn Center for Speech and Language Technologies, RIIT, Tsinghua University, Beijing 100084, China E-mail: {yqxia, fzheng}@tsinghua.edu.cn Abstract: As a major task of the product opinion mining system, product attribute extraction influences performance of the system significantly. To find the out-of-the-vocabulary (OOV) product attributes, an effective attribute mining algorithm is proposed based on language dependency parsing and corpus statistical analysis. Based on a small set of standard product attributes, this algorithm applies the dependency parsing tool on review text to find the potential OOV product attributes. Then statistical features extracted from both the dependency parsing results and text content are used to filter out the invalid OOV product attributes and rank the attribute by measuring the confidence. Experiments are conducted to evaluate precision of OOV attribute extraction and effectiveness of the ranking method. Moreover, contribution of various features is also evaluated. Experiment results show that precision of OOV attribute extraction reaches 87.5% in the top 200 candidates.
What problem does this paper attempt to address?