Research on Method of Text Classification Rule Extraction Based on Genetic Algorithm and Entropy

TANG Hua,ZENG Bi-qing
DOI: https://doi.org/10.3321/j.issn:0529-6579.2007.05.005
2007-01-01
Abstract:Aimed at the text classification problems in data mining,a text classification rule extraction method is proposed based on genetic algorithm and entropy for rule discovery called Genetic-Miner(GM).The goal of GM is to discover classification rules in data sets.It produces population with the entropy and then extract classification rule with genetic algorithm.Compared the performance of GM with other two well-known algorithms Ant-miner and CN2 in six public domain data sets,the results showed that GM has a better performance in both predictive accuracy and rule list simplicity criteria than Ant-Miner and CN2.It can also mostly improve the comprehensibility of the discovered knowledge.
What problem does this paper attempt to address?