An Efficient K-anonymization Algorithm Combining C-modes with MDAV.

Jian-min Han,Yu Juan,Huiqun Yu,Ting-ting Cen
DOI: https://doi.org/10.1109/GRC.2008.4664671
2008-01-01
Abstract:Individual privacy preservation has recently become an increasingly important issue when publishing microdata for mining purpose. K-anonymity is a popular model for protecting privacy, which requires that each record in the released dataset be indistinguishable with at least (k-1) other records with respect to quasi-identifier, MDAV an efficient k-anonymization algorithm, has been extensively investigated and applied. However MDAV's efficiency decreases dramatically with dataset size increasing. C-modes is an efficient clustering algorithm for large dataset, but which cannot realize k-anonymity. Combining C-Modes with MDAV we propose an efficient algorithm for large dataset k-anonymization problems. Experiments show that, compared with MDAV algorithm, the proposed algorithm increases efficiency dramatically especially for large dataset.
What problem does this paper attempt to address?