Fast Randomized Algorithm For Minimal Test Cost Attribute Reduction

jingkuan li,fan min,william zhu
DOI: https://doi.org/10.1142/S0218539314500284
2014-01-01
International Journal of Reliability Quality and Safety Engineering
Abstract:Attribute reduction is a key data preprocessing technique, and has been widely studied in data mining, machine learning, and granular computing. Minimal test cost attribute reduction is one of important parts researched in cost-sensitive learning. The backtracking algorithm can obtain an optimal reduct, however on only small datasets due to the NP-hardness of the problem. Heuristic algorithms, such as the genetic one and the information gain based one, are employed to deal with this problem. In this paper, we propose the Fast Randomized Algorithm to obtain a satisfactory reduct more efficiently. The focus of the algorithm is a randomization mechanism that deals with attributes addition and deletion. There are two important parameters in the addition stage, namely the selecting probability of attributes and the number of selected attributes per batch. We obtain some appropriate parameter settings through experiments in a variety of datasets. Results show that the optimal settings of two parameters rarely change on different datasets. Our algorithm is more stable and significantly more efficient than existing heuristic ones.
What problem does this paper attempt to address?