Decision Tree for Uncertain Data Based on Reachable Probability Intervals

CHEN Hongmei,WANG Lizhen,LIU Weiyi,YUAN Lijian
DOI: https://doi.org/10.3778/j.issn.1673-9418.2012.08.006
2012-01-01
Abstract:This paper studies a decision tree for value-uncertain discrete objects missing probabilities, because it is difficult to obtain the probability distributions over uncertain data in applications. Firstly, the paper defines the (con-ditional) probability intervals, and proves that the (conditional) probability intervals are the reachable probability intervals. Secondly, based on the reachable probability intervals, it defines the (conditional) entropy intervals, and gives a method to compute the upper and the lower bounds of the (conditional) entropy intervals. Finally, it presents a new decision tree for uncertain data, in which the conditional entropy intervals are used to select the best attributes and objects are assigned to the branches with probability intervals. The decision tree can handle both value-uncertain discrete objects missing probabilities and certain discrete objects. Experiments with uncertain datasets based on UCI datasets show the satisfactory performance.
What problem does this paper attempt to address?