Abstract:Classification can be regarded as dividing the data space into decision regions separated by decision boundaries. In this paper we analyze decision tree algorithms and the NBTree algorithm from this perspective. Thus, a decision tree can be regarded as a classifier tree, in which each classifier on a non-root node is trained in decision regions of the classifier on the parent node. Meanwhile, the NBTree algorithm, which generates a classifier tree with the C4.5 algorithm and the naive Bayes classifier as the root and leaf classifiers respectively, can also be regarded as training naive Bayes classifiers in decision regions of the C4.5 algorithm. We propose a second division (SD) algorithm and three soft second division (SD-soft) algorithms to train classifiers in decision regions of the naive Bayes classifier. These four novel algorithms all generate two-level classifier trees with the naive Bayes classifier as root classifiers. The SD and three SD-soft algorithms can make good use of both the information contained in instances near decision boundaries, and those that may be ignored by the naive Bayes classifier. Finally, we conduct experiments on 30 data sets from the UC Irvine (UCI) repository. Experiment results show that the SD algorithm can obtain better generalization abilities than the NBTree and the averaged one-dependence estimators (AODE) algorithms when using the C4.5 algorithm and support vector machine (SVM) as leaf classifiers. Further experiments indicate that our three SD-soft algorithms can achieve better generalization abilities than the SD algorithm when argument values are selected appropriately.

A Parallelized Semi-Supervised Na(i)ve Bayes Classifier

A Bayesian Network nearest k-labels method for Multi-label classification

A Technique For Improving The Performance Of Naive Bayes Text Classification

Parallel naive Bayes algorithm for large-scale Chinese text classification based on spark

Evolutionary Lazy Learning for Naive Bayes Classification

Dual-Classifier Collaborative Method Based on Semi-Supervised Active Learning

Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data

Semi-supervised Dynamic Counter Propagation Network

SNNB: A Selective Neighborhood Based Naïve Bayes for Lazy Learning.

Semi-supervised Learning for K-Dependence Bayesian Classifiers

The Study of Naive Bayes Algorithm Online in Data Mining

A Parallel Varied Density-Based Clustering Algorithm with Optimized Data Partition

A Hybrid Generative/discriminative Method for Semi-Supervised Classification

Joint Semi-Supervised Feature Selection and Classification Through Bayesian Approach

Enhancing SNNB with Local Accuracy Estimation and Ensemble Techniques

Efficient Heuristics for Learning Bayesian Network from Labeled and Unlabeled Data

BCCNet: Bayesian classifier combination neural network

Improving Naive Bayes Classifier by Dividing Its Decision Regions

Learning Balanced Bayesian Classifiers from Labeled and Unlabeled Data

Acceleration of Naive-Bayes algorithm on multicore processor for massive text classification

An efficient multi-relational Naïve Bayesian classifier based on semantic relationship graph