Abstract:Classification can be regarded as dividing the data space into decision regions separated by decision boundaries. In this paper we analyze decision tree algorithms and the NBTree algorithm from this perspective. Thus, a decision tree can be regarded as a classifier tree, in which each classifier on a non-root node is trained in decision regions of the classifier on the parent node. Meanwhile, the NBTree algorithm, which generates a classifier tree with the C4.5 algorithm and the naive Bayes classifier as the root and leaf classifiers respectively, can also be regarded as training naive Bayes classifiers in decision regions of the C4.5 algorithm. We propose a second division (SD) algorithm and three soft second division (SD-soft) algorithms to train classifiers in decision regions of the naive Bayes classifier. These four novel algorithms all generate two-level classifier trees with the naive Bayes classifier as root classifiers. The SD and three SD-soft algorithms can make good use of both the information contained in instances near decision boundaries, and those that may be ignored by the naive Bayes classifier. Finally, we conduct experiments on 30 data sets from the UC Irvine (UCI) repository. Experiment results show that the SD algorithm can obtain better generalization abilities than the NBTree and the averaged one-dependence estimators (AODE) algorithms when using the C4.5 algorithm and support vector machine (SVM) as leaf classifiers. Further experiments indicate that our three SD-soft algorithms can achieve better generalization abilities than the SD algorithm when argument values are selected appropriately.

Boai: Fast Alternating Decision Tree Induction Based On Bottom-Up Evaluation

FROD: an Efficient Framework for Optimizing Decision Trees in Packet Classification

Speeding Up Very Fast Decision Tree with Low Computational Cost

Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees

Extremely Fast Decision Tree

bsnsing: A decision tree induction method based on recursive optimal boolean rule composition

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

A Communication-Efficient Parallel Algorithm for Decision Tree

A New Method for Learning Decision Tree Classifier

Boosting Trees for Cost-Sensitive Classifications.

Dive into Decision Trees and Forests: A Theoretical Demonstration

Improving Naive Bayes Classifier by Dividing Its Decision Regions

Visual Diagnosis of Tree Boosting Methods

Efficient Tree Classifiers for Large Scale Datasets.

Strong Optimal Classification Trees

An Improvement On The Algorithm Of Decision Tree

Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features

Taiga: Performance Optimization of the C4.5 Decision Tree Construction Algorithm

Saving Evaluation Time for the Decision Function in Boosting: Representation and Reordering Base Learner.

Averaged Tree-Augmented One-Dependence Estimators