An kNN Algorithm Based on Vector Angle for Multi-label Text Categorization

Kai GUANG,Jin-Gui PAN
DOI: https://doi.org/10.3969/j.issn.1002-137X.2008.04.061
2008-01-01
Computer Science
Abstract:In multi-label learning, each instance in the training set is associated with a set of labels, and the task is to output a label set whose size is unknown a priori for each unseen instance. k nearest neighbors (kNN) algorithm is recently applied to multi-label categorization. In detail, each instance is transformed into a vector and the label vector of the test instance is determined by its k nearest neighbors, which are chosen by the Euclidean distance of a couple of vectors. In this paper, a multi-label lazy learning approach named θ -MLkNN is presented, which is derived from the traditional k nearest neighbor (kNN) algorithm. Instead, we select the k nearest neighbors by the angle of two vectors. Experiments on a real-world text data set show that θ -MLkNN achieves better precision to traditional MLkNN algorithms.
What problem does this paper attempt to address?