Efficient building algorithms of decision tree for uniformly distributed uncertain data.

Chenggang Li,Liping Huang,Ling Tian
DOI: https://doi.org/10.1109/ICNC.2011.6022055
2011-01-01
Abstract:Developing algorithms for uncertain data is one of the most active themes in data mining community. A number of different decision tree classifiers have been studied in order to deal with uncertain data. This paper extends these works. In this paper, we develop a tree-pruning algorithm using sum of the tuples fractions based on probability theory. By pruning, we find that the accuracy of the classifier is improved and the efficiency of building the decision tree is also improved. Besides, we find that under the context of uniformly distribution, increasing the sampling density of the uncertain attribute value can make little contribution to improve the accuracy, but is computationally more costly. So we propose a new method of sampling. Using this sampling method, the execution time of building the decision tree is greatly decreased. © 2011 IEEE.
What problem does this paper attempt to address?