QUALITY ASSESSMENT AND UNCERTAINTY HANDLING IN SPATIAL DATA MINING

Binbin He,Dazhi Guo
2004-01-01
Abstract:Spatial data mining refers to extracting or "mining" knowledge or interesting spatial / non- spatial patterns (rules) hidden in spatial databases, and usually the data in databases are characterized by large-amounts, incompleteness, noise, fuzziness and randomness. The most researches of spatial data mining focused on the method of data mining and its algorithms in the past years. Although uncertainties exist in spatial data mining, they had not been paid much attention to. The overall objective in this paper is to develop a framework for quality assessment and uncertainty handling in spatial data mining. For this purpose, we have adopted several quality assessment indices for the results of spatial data mining and a methodology for handling uncertainty in spatial data mining based on fuzzy logic and Dempster-Shafer theory. Firstly, the uncertainties at various stages of spatial data mining are briefly analyzed. And some kinds of quality assessment indices such as data clustering, classification and association rule mining are adopted. Then, the uncertainty in spatial data mining is probed. In this connection, fuzzy logic and Dempster- Shafer theory are used for the representation and handling of uncertainty in the process of spatial data mining.
What problem does this paper attempt to address?