Short-text Classification Method Based on Concept Network

LIN Xiao-jun,ZHANG Meng,BAO Xiao,LI Jun,WU Xi-hong
DOI: https://doi.org/10.3969/j.issn.1000-3428.2010.21.002
2010-01-01
Abstract:Aiming at the short-text classification in archive domain,this paper designs an automatic classification method based on concept network.It constructs domain ontology by analyzing the short-text language characteristic in domain,and converts the short-text of title to structural concept network which expresses through Resource Description Framework(RDF) by means of natural language processing technology.On that basis,it defines a similarity measure for archives to classify the retention period of archives.Experimental results show that this method gets a relative 24.2% decrease in classification error rate,and it improves the system performance compared with traditional short-text classification method based on characteristic selection.
What problem does this paper attempt to address?