A Novel Approach to Naive Bayes Web Page Automatic Classification

Zhongli He,Zhijing Liu
DOI: https://doi.org/10.1109/FSKD.2008.284
2008-01-01
Abstract:In this paper, a novel approach of web page classification using Naive Bayes (NB) classifier based on Independent Component Analysis (ICA) is proposed. In order to perform the classification, a web page is firstly represented by a vector of features with different weights, and the weight calculated method is improved. As the number of the features is big, Principal Component Analysis (PCA) which is to select the relevant features will perform in preprocessing section as input for improved ICA algorithm (MFICA). Finally, the output of MFICA is sent to NB classifier for classification to boost the classifier’s performance. The experimental evaluation demonstrates that the NB classifier based on ICA model provides acceptable classification accuracy.
What problem does this paper attempt to address?