An Attribute Discretization Algorithm Based on Rough Set and Information Entropy

He Liu,Da-You Liu,Xiao-Hu Shi,Ying Gao
DOI: https://doi.org/10.1109/icmlc.2008.4620405
2008-01-01
Abstract:Attribute discretization is one of the key issues for the Rough Set theory. First, a method is proposed to compute an initial cut points set. The indistinguishable relation of decision tables did not change, and the number of elements in the initial cut points set was reduced. Then, the cut point information entropy was defined to measure the importance of a cut point. Finally, an attribute discretization algorithm based on the Rough Set and information entropy was proposed. The consistence of decision tables did not change, and the mixed decision table was considered, which contains continuous and discrete attributes. The experimental results show that this algorithm is effective and is competent for processing the large-scale datasets.
What problem does this paper attempt to address?