An Automatic Algorithm of Text Categorization Imitating Human's

王树梅,戴保存,黄河燕,陈肇雄
DOI: https://doi.org/10.3969/j.issn.1002-137X.2003.03.011
2003-01-01
Computer Science
Abstract:An algorithm of text classification is given that imitates human's in this paper. On one hand, the algorithmenhances weight of theme when feature vector is processed, because of the assumption that the title of a document canproject its content. On the other hand,a weight parameter o vector is designed to simulate human's skimming andskipping behavior for calculating method of a document cluster center, and a weight of the feature that there are morepositive examples than negative ones is enhanced . The experiment shows that the algorithm greatly improves the per-formance of a text classification system.
What problem does this paper attempt to address?