SAFELOC: Overcoming Data Poisoning Attacks in Heterogeneous Federated Machine Learning for Indoor Localization

Akhil Singampalli,Danish Gufran,Sudeep Pasricha
2024-11-14
Abstract:Machine learning (ML) based indoor localization solutions are critical for many emerging applications, yet their efficacy is often compromised by hardware/software variations across mobile devices (i.e., device heterogeneity) and the threat of ML data poisoning attacks. Conventional methods aimed at countering these challenges show limited resilience to the uncertainties created by these phenomena. In response, in this paper, we introduce SAFELOC, a novel framework that not only minimizes localization errors under these challenging conditions but also ensures model compactness for efficient mobile device deployment. Our framework targets a distributed and co-operative learning environment that uses federated learning (FL) to preserve user data privacy and assumes heterogeneous mobile devices carried by users (just like in most real-world scenarios). Within this heterogeneous FL context, SAFELOC introduces a novel fused neural network architecture that performs data poisoning detection and localization, with a low model footprint. Additionally, a dynamic saliency map-based aggregation strategy is designed to adapt based on the severity of the detected data poisoning scenario. Experimental evaluations demonstrate that SAFELOC achieves improvements of up to 5.9x in mean localization error, 7.8x in worst-case localization error, and a 2.1x reduction in model inference latency compared to state-of-the-art indoor localization frameworks, across diverse building floorplans, mobile devices, and ML data poisoning attack scenarios.
Machine Learning,Artificial Intelligence,Cryptography and Security
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in a heterogeneous federated machine - learning environment, how to overcome the impact of data - poisoning attacks on indoor positioning systems. Specifically, the paper focuses on: 1. **Device heterogeneity**: Due to differences in hardware, software, and firmware of different mobile devices, there is additional variability in the received Wi - Fi signal strength (RSS) measurements, which makes it difficult for machine - learning models to maintain accurate predictions. 2. **Data - poisoning attacks**: Malicious clients deliberately inject tampered data into the learning process of the global model (GM), resulting in a significant decline in the performance of the global model and thus affecting the accuracy of indoor positioning. To address these problems, the authors propose a new framework named SAFELOC, aiming to: - Minimize positioning errors in the presence of device heterogeneity and data - poisoning attacks. - Ensure that the model is compact for efficient deployment on resource - limited mobile devices. - Use federated learning (FL) to protect user data privacy and assume that the mobile devices carried by users are heterogeneous, similar to most real - world scenarios. By introducing a new fused neural - network architecture and a saliency - map - based aggregation strategy, SAFELOC can not only detect and remove data - poisoning but also perform efficient positioning tasks, thereby improving the robustness and accuracy of the system. ### Specific problem summary: - How to effectively detect and defend against data - poisoning attacks in a heterogeneous federated - learning environment? - How to ensure that the model operates efficiently on resource - constrained mobile devices while maintaining high - precision indoor - positioning capabilities? Through these improvements, SAFELOC has achieved significant performance improvements in various building floor plans, mobile devices, and ML data - poisoning - attack scenarios, including a 5.9 - fold reduction in the average positioning error, a 7.8 - fold reduction in the worst - case positioning error, and a 2.1 - fold reduction in model - inference latency.