Analyzing the Classifier using the Knowledge Hierarchy

Yan Zhiyong,Xu Congfu
2011-01-01
Abstract:In this paper, we analyze the classifier in machine learning using the knowledge hierarchy in information science. An instance provides the type of information answering questions begin with where and what. A classifier is the type of knowledge answering the question begins with how, and a classification model is the type of knowledge answering the question begins with why. A classifier utilizes information contained in training data set to obtain the knowledge represented by a classification model. We analyze types of information utilized by four typical classifiers, C4.5 algorithm, Naive Bayes classifier, support vector machine and k nearest neighbors. We also analyze typical improving methods of above four classifiers from the perspective of information utilized and divide information into global information and local information according to numbers of instances and attributes used.
What problem does this paper attempt to address?