A Trajectory K-Anonymity Model Based on Point Density and Partition

Wanshu Yu,Haonan Shi,Hongyun Xu
DOI: https://doi.org/10.48550/arXiv.2307.16849
2023-08-01
Abstract:As people's daily life becomes increasingly inseparable from various mobile electronic devices, relevant service application platforms and network operators can collect numerous individual information easily. When releasing these data for scientific research or commercial purposes, users' privacy will be in danger, especially in the publication of spatiotemporal trajectory datasets. Therefore, to avoid the leakage of users' privacy, it is necessary to anonymize the data before they are released. However, more than simply removing the unique identifiers of individuals is needed to protect the trajectory privacy, because some attackers may infer the identity of users by the connection with other databases. Much work has been devoted to merging multiple trajectories to avoid re-identification, but these solutions always require sacrificing data quality to achieve the anonymity requirement. In order to provide sufficient privacy protection for users' trajectory datasets, this paper develops a study on trajectory privacy against re-identification attacks, proposing a trajectory K-anonymity model based on Point Density and Partition (KPDP). Our approach improves the existing trajectory generalization anonymization techniques regarding trajectory set partition preprocessing and trajectory clustering algorithms. It successfully resists re-identification attacks and reduces the data utility loss of the k-anonymized dataset. A series of experiments on a real-world dataset show that the proposed model has significant advantages in terms of higher data utility and shorter algorithm execution time than other existing techniques.
Cryptography and Security,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to protect user privacy when releasing users' spatio - temporal trajectory data sets and prevent malicious attackers from identifying users through trajectory data. Specifically, existing trajectory privacy - protection algorithms have difficulty accurately measuring the similarity between trajectories when dealing with irregularly - shaped distributed trajectory data, resulting in poor clustering and generalization effects and thus relatively large information loss of the released data sets compared to the original data sets. Therefore, this paper proposes a trajectory K - anonymity model (KPDP) based on point density and partitioning, aiming to improve the trajectory set pre - processing and trajectory clustering algorithms in trajectory generalization anonymization techniques, increase data utility while reducing information loss, and effectively resist re - identification attacks. The main contributions mentioned in the paper include: - Proposing a trajectory K - anonymity model (KPDP) based on point density and partitioning, which solves the high - information - loss problem of existing models due to irregular trajectory - shape distribution and special data structures. - Introducing an adaptive DBSCAN trajectory clustering algorithm, which further enhances the data utility of anonymized trajectory data sets through a clustering method based on sample density while achieving k - anonymity. - Through a series of experiments on real - world data sets, it is proved that the proposed model has significant advantages over other existing techniques in terms of information loss and algorithm execution time. In summary, this paper mainly focuses on how to effectively protect users' privacy and security in the spatio - temporal trajectory data release process while ensuring data utility.