Abstract:With the explosive growth of big data, organizations are strongly encouraged to release their micro-data to support data-intensive analysis services, to provide new business opportunities and to allow every kind of scientific study as well. However, releasing medical records about individuals violates their privacy thus, privacy-preserving data publishing has become a critical issue for companies and organizations. Existing privacy protection anonymous technique mainly conducts operation directing at quasi-identifier attributes without consideration of specific relation between different values of sensitive attribute, which results in revealing of individual privacy information. The paper conducts detailed research in allusion to correlation between valuing of sensitive attribute, carries forward the idea of conducting protection to initial data by lossy join, and proposes Twice-privacy algorithm based on utility matrix and multiattribute clustering. Twice-privacy conducts a clustering of sensitive values to protect similarity, sets different weight to retain quasi-identifier attribute to query service; data obtained by clustering algorithm are of high accuracy and high value. Experimental results on real datasets show the effectiveness and efficiency of Twice-privacy algorithm. Our solutions reduce the similarity attack rate to 0%. Meanwhile, the query correction rate and analysis correction rate of the proposed have obvious promotion, inquire accuracy and analysis accuracy are also enhance.

Utility-based Anonymisation for Dataset with Multiple Sensitive Attributes

Rating: Privacy Preservation for Multiple Attributes with Different Sensitivity Requirements

An Enhanced K-Anonymity Model Against Homogeneity Attack.

A New K-anonymity Algorithm towards Multiple Sensitive Attributes

Utility-based Anonymization for Privacy Preservation with Less Information Loss

Anonymizing 1:M Microdata with High Utility

Privacy Protection on Multiple Sensitive Attributes

The K-Anonymization Method Satisfying Personalized Privacy Preservation

Privacy-preserving data publishing method for dataset with multi-dimensional sensitive attributes

Clustering-Based k-anonymity

Towards the Diversity of Sensitive Attributes in k-Anonymity

A Multi-phase K-anonymity Algorithm Based on Clustering Techniques

Local Generalization and Bucketization Technique for Personalized Privacy Preservation

A New K-anonymity Algorithm for Privacy Protection

Privacy-Preserving Data Publishing for Multiple Numerical Sensitive Attributes

Privacy Inference Attacking And Prevention On Multiple Relative K-Anonymized Microdata Sets

A Utility-aware Anonymization Model for Multiple Sensitive Attributes Based on Association Concealment

SLOMS: A Privacy Preserving Data Publishing Method for Multiple Sensitive Attributes Microdata.

(α, k)-anonymity based privacy preservation by lossy join

Towards Optimal K-Anonymization.

Privacy Protecting by Multiattribute Clustering in Data-Intensive Service