Feature selection using syntactic and semantic information in question classification

YUAN Xiao-jie,SHI Jian-xing,NING Hua,YU Shi-tao
DOI: https://doi.org/10.3778/j.issn.1002-8331.2008.33.045
2008-01-01
Abstract:Question classification is a very important sub-module of question answering system,and the key lies in the feature selection.This paper proposes a new feature selection method based on syntactic and semantic information,using the question word,the main verb of the question,the dependency structure,the main noun and the top hypernym of the noun as features for classification.Evaluate the effect of feature selection using KNN and Nave Bayes classifiers,and attain an expected result.In the predefined question taxonomy,the classification accurate reaches 82.2% and 83.7% respectively.It is better than the method using bag-of-words features.
What problem does this paper attempt to address?