Grey Maximum Distance to Average Vector Based on Quasi-identifier Attribute

Hong Liu,Qishan Zhang,Kun Guo,Yingyi Wu
2018-01-01
Journal of grey system
Abstract:k-anonymity is an effective method of privacy preserving. MDAV (Maximum Distance to Average Vector) is one of the algorithms for k-anonymous model. MDAV uses Euclidean distance to measure the homogeneity between different records, which treats all attributes equally and covers the different importance of each attribute. However the importance of each attribute is different in actual situation and it needs to be treated differently. In this paper, from the perspective of qusi-identifier attribute, grey relational analysis is introduced into improve the measurement, a novel GMDAV (Grey Maximum Distance to Average Vector) is proposed for k-anonymous. For the close distance between tuples, considering the importance of the quasi-identity attribute and the similarity of the important attributes, a comprehensive measure method with the weighted Euclidean distance based on grey relational analysis is proposed to determine the importance of attributes. Based on the evaluation method of information loss model, according to the importance of attributes, an evaluation model of attribute information loss based on grey weight is put forward. By analyzing the time complexity of GMDAV, the results show that the proposed GMDAV can achieve the same time complexity as the traditional MDAV. Finally, three classical data sets were selected for the experiment. The experimental results show that GMDAV are all better than MDAV in the aspects of information loss and distance linked disclosure, GMDAV could effectively reduce the loss of information and reduce the overall information disclosure risk of the important attribute values.
What problem does this paper attempt to address?