A Survey on Differential Privacy for SpatioTemporal Data in Transportation Research

Rahul Bhadani
2024-07-18
Abstract:With low-cost computing devices, improved sensor technology, and the proliferation of data-driven algorithms, we have more data than we know what to do with. In transportation, we are seeing a surge in spatiotemporal data collection. At the same time, concerns over user privacy have led to research on differential privacy in applied settings. In this paper, we look at some recent developments in differential privacy in the context of spatiotemporal data. Spatiotemporal data contain not only features about users but also the geographical locations of their frequent visits. Hence, the public release of such data carries extreme risks. To address the need for such data in research and inference without exposing private information, significant work has been proposed. This survey paper aims to summarize these efforts and provide a review of differential privacy mechanisms and related software. We also discuss related work in transportation where such mechanisms have been applied. Furthermore, we address the challenges in the deployment and mass adoption of differential privacy in transportation spatiotemporal data for downstream analyses.
Cryptography and Security,Computers and Society,Machine Learning,Methodology
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is **how to protect user privacy when releasing spatio - temporal data in traffic research**. Specifically, with the popularization of low - cost computing devices, improved sensor technologies and data - driven algorithms, we have collected a large amount of spatio - temporal data. These data not only contain user characteristics but also record the geographical activity trajectories of users. Therefore, publicly releasing such data may expose users' private information and bring serious privacy risks. To meet this challenge, the paper focuses on the application of **Differential Privacy (DP)** technology in spatio - temporal data. Differential privacy is a statistical method, aiming to ensure that the results of data analysis will not change significantly due to the presence or absence of a single data point, thereby protecting individual privacy. By reviewing the development of differential privacy in spatio - temporal data in recent years, the paper provides relevant mechanisms and techniques and discusses its application examples in the traffic field. In addition, the paper also points out the challenges faced in applying differential privacy in the traffic field, such as: 1. **High correlation of spatio - temporal data**: If spatial or temporal information is not considered when synthesizing perturbations of data, it may lead to outliers and undermine the effect of differential privacy. 2. **Processing of high - dimensional data**: Spatio - temporal data in the traffic field usually come from multiple sensor modalities, and there may be correlations between various dimensions. Traditional methods may not be sufficient to handle this complexity. 3. **Application in autonomous driving**: There is currently no evidence of successful use of differential privacy data to achieve autonomous driving operations, which is an urgent problem to be solved. In summary, the goal of this paper is to summarize the progress of the application of differential privacy in spatio - temporal data, provide references for researchers, and promote the further development of differential privacy technology in the traffic field.