Data Mining in Incomplete Information

赵卫东,盛昭瀚,李旗号
DOI: https://doi.org/10.3969/j.issn.1005-2542.2001.02.012
2001-01-01
Abstract:Data mining in incomplete information systems is difficult but inevitable for uncertain decision. Besides null-valued and layered-valued attributes, interval-valued attributes are also attributed to incomplete information. Extension matrix and rough set theory are used herein to deal with the incompleteness. First the choice of interval-valued attributes is proved to be a NP-hard problem and a heuristic algorithm based on minimal partition dots is proposed. Then a probability tree and fuzzy tree are discussed in detail to improve the performance of previous decision tree algorithms e.g. ID3, C4.5, which are susceptible to noise.
What problem does this paper attempt to address?