Knowledge Discovery from Noisy Datasets

Hong Li,Yu Zong,Enhong Chen
DOI: https://doi.org/10.1007/978-3-642-25646-2_72
2011-01-01
Abstract:It is a significant challenges to deal with the noise data in data mining and knowledge discovery applications. Most of previous works on data cleansing and correction have been focused on addressing class noise or attribute noise for the benefit of the subsequent mining process. In this paper, we propose an error-sensitive(ES) data mining framework, which makes use of noise knowledge to restore original data distributions and accommodates noise knowledge to enhance data classification accuracy. We materialize our main idea by constructing Attribute-Decision tree and measureing correlation among attributes. Experimental results show that ES data mining procedure has ability to significantly improve the quality of data mining results.
What problem does this paper attempt to address?