Community question topic categorization via hierarchical kernelized classification.

Wen Chan,Weidong Yang,Jinhui Tang,Jintao Du,Xiangdong Zhou,Wei Wang
DOI: https://doi.org/10.1145/2505515.2505676
2013-01-01
Abstract:We present a hierarchical kernelized classification model for the automatic classification of general questions into their corresponding topic categories in community Question Answering service (cQAs). This could save many efforts of manual classification and facilitate browsing as well as better retrieving of questions from the cQA archives. To deal with the challenge of short text message of questions, we explore and optimally combine various cQA features by introducing multiple kernel learning strategy into the hierarchical classification framework. We propose a hybrid regularization approach of combining orthogonal constraint and L1 sparseness in our framework to promote the discriminative power on similar topics as well as sparsing the model parameters. The experimental results on a real world dataset from Yahoo! Answers demonstrate the effectiveness of our proposed model as compared to the state-of-the-art methods and strong baselines.
What problem does this paper attempt to address?