A New Method for Learning Decision Tree Classifier

Zulkarnain Saurav,D. Farid,Sadot Arefin,Maliha Mehjabin Mitu,Md. Afanur Hasan,Nigar Sultana Ritu
DOI: https://doi.org/10.1109/ECCE57851.2023.10101557
2023-02-23
Abstract:Decision Tree (DT) induction is one of the popular data modelling techniques that commonly used in many real-world supervised learning problems. DT is a top-down recursive divide-and-conquer algorithm. It divides the datasets until all the instances of a sub-dataset belongs to the same class value. There are several advantages of DT e.g. it is easy to implement and easy to understand, it takes little prior knowledge, non-linear relationship don't affect the tree performance. DT can be used for both classification and regression tasks. The traditional DT methods use information gain, gain ratio, and gini values to select the root/best node. DT can be scalable to deal with Big Data e.g. RainForest and BOAT (Optimistic Decision Tree Construction) algorithms. In this paper, we have proposed a new method for decision tree construction where instead of using the traditional informative feature selection technique, we have considered each of the input feature to build decision tree and used these trees as an ensemble classifier. Each of the tree predicts the test data and returns the majority voting. We have tested the performance of proposed method with the well-known C4.5 classifier using datasets taken from UC Irvine Machine Learning Repository. The proposed method outperforms C4.5 classifier by almost 30%.
Computer Science
What problem does this paper attempt to address?