A Privacy-Preserving Classification Mining Algorithm

Weiping Ge,Wei Wang,Xiaorong Li,Baile Shi
DOI: https://doi.org/10.1007/11430919_32
2006-01-01
Journal of Computer Research and Development
Abstract:Privacy-preserving classification mining is one of the fast-growing sub-areas of data mining. How to perturb original data and then build a decision tree based on perturbed data is the key research challenge. By applying transition probability matrix this paper proposes a novel privacy-preserving classification mining algorithm which suits all data types, arbitrary probability distribution of original data, and perturbing all attributes (including label attribute). Experimental results demonstrate that decision tree built using this algorithm on perturbed data has comparable classifying accuracy to decision tree built using un-privacy-preserving algorithm on original data.
What problem does this paper attempt to address?