Distribution-Based Selective Classifiers for Incomplete Data

CHEN Jingnian,HUANG Houkuan,YANG Liping,TIAN Fengzhan
DOI: https://doi.org/10.3969/j.issn.1673-0291.2008.02.007
2008-01-01
Abstract:Selective classifiers are a kind of algorithms that can effectively improve the accuracy and efficiency of classification by deleting irrelevant or redundant attributes of a data set.Due to the complexity of processing incomplete data,however,most of them deal with complete data.Yet actual data are often incomplete and have many redundant or irrelevant attributes.a selective classifier for incomplete data(SDBNB),which is based on a newly constructed Bayes classifier(DBNB),is presented.Experiments results from twelve benchmark incomplete data sets show that the average accuracy of SDBNB is 0.69 percent and 0.58 percent higher than that of the effective selective classifiers: SNB and SRBC.Furthermore,its standard deviation is 0.11 and 0.05 lower than that of SNB and SRBC.This shows that not only SDBNB has higher accuracy,but also performs more stably as well.
What problem does this paper attempt to address?