Protecting Activity Sensing Data Privacy Using Hierarchical Information Dissociation

Guangjing Wang,Hanqing Guo,Yuanda Wang,Bocheng Chen,Ce Zhou,Qiben Yan
2024-09-04
Abstract:Smartphones and wearable devices have been integrated into our daily lives, offering personalized services. However, many apps become overprivileged as their collected sensing data contains unnecessary sensitive information. For example, mobile sensing data could reveal private attributes (e.g., gender and age) and unintended sensitive features (e.g., hand gestures when entering passwords). To prevent sensitive information leakage, existing methods must obtain private labels and users need to specify privacy policies. However, they only achieve limited control over information disclosure. In this work, we present Hippo to dissociate hierarchical information including private metadata and multi-grained activity information from the sensing data. Hippo achieves fine-grained control over the disclosure of sensitive information without requiring private labels. Specifically, we design a latent guidance-based diffusion model, which generates multi-grained versions of raw sensor data conditioned on hierarchical latent activity features. Hippo enables users to control the disclosure of sensitive information in sensing data, ensuring their privacy while preserving the necessary features to meet the utility requirements of applications. Hippo is the first unified model that achieves two goals: perturbing the sensitive attributes and controlling the disclosure of sensitive information in mobile sensing data. Extensive experiments show that Hippo can anonymize personal attributes and transform activity information at various resolutions across different types of sensing data.
Cryptography and Security,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **How to protect user privacy and prevent the leakage of sensitive information without compromising the utility of mobile device activity - sensing data**. Specifically, the paper focuses on the privacy issues of sensor data in smart phones and wearable devices. ### Problem Background 1. **Metadata - level over - privilege problem**: - Activity - sensing data contains redundant personal attribute information (such as age, gender). This information is unnecessary for the activity - sensing function but has privacy risks. - Attackers can use these unique features to infer users' sensitive attributes. 2. **Feature - level over - privilege problem**: - Applications may collect more fine - grained activity features than actually needed, which may lead to the leakage of users' sensitive behaviors. - For example, when a smart watch is used for fall detection, the collected sensor data may also record hand movements, thus revealing the user's password - input gestures. ### Limitations of Existing Methods - **Adversarial Training**: It requires labeled private data for model training, which is difficult to obtain. - **Noise Addition**: Although it can disrupt sensitive information, it will damage the utility of data. - **Filtering Mechanism**: Existing methods either completely release or remove data segments, resulting in damaged data utility. ### Hippo's Solution To address the above problems, the paper proposes the **Hippo** system to protect user privacy in the following ways: 1. **Hierarchical Information Decoupling**: - Hippo uses a Latent Feature Guidance - based Diffusion Model to decouple hierarchical information, including private metadata and multi - granularity activity information. - This method can achieve fine - grained control of sensitive information without relying on private labels. 2. **Multi - granularity Data Generation**: - Hippo reconstructs the original multi - granularity sensor data, allowing users to selectively remove the sensitive information they do not wish to disclose while retaining the necessary features to meet the application's requirements. - For example, users can choose to only disrupt metadata information or remove high - resolution gesture features while retaining coarse - grained activity information. 3. **No Need for Private Labels**: - Hippo does not require users to provide private labels or sensitive data for training, thereby reducing privacy risks. ### Experimental Verification The paper verifies the effectiveness and practicality of Hippo through extensive experiments: - **Metadata - level Protection**: Hippo can reduce the probability of private attribute inference to 50% while maintaining the same activity recognition accuracy as the original data. - **Feature - level Protection**: Hippo can generate activity data of different granularities, preventing the leakage of sensitive activity information without affecting the utility of data. In conclusion, Hippo is an innovative system that aims to protect user privacy and prevent the leakage of sensitive information by reconstructing multi - granularity activity - sensing data while ensuring that the utility of data is not affected.