FedSub: Introducing class-aware Subnetworks Fusion to Enhance Personalized Federated Learning in Ubiquitous Systems

Mattia Giovanni Campana,Franca Delmastro
2024-11-13
Abstract:Personalized Federated Learning is essential in AI-driven ubiquitous systems, supporting the distributed development of models able to adapt to diverse and evolving user behaviors while safeguarding privacy. Despite addressing heterogeneous user data distributions in collaborative model training, existing methods often face limitations balancing personalization and generalization, oversimplifying user similarities, or relying heavily on global models. In this paper, we propose FedSub, a novel federated approach designed to enhance personalization through the use of class-aware prototypes and model subnetworks. Prototypes serve as compact representations of user data, clustered on the server to identify similarities based on specific label patterns. Concurrently, subnetworks -- model components necessary to process each class -- are extracted locally and fused by the server according to these clusters, producing highly tailored model updates for each user. This fine-grained, class-specific aggregation of clients' models allows FedSub to capture the unique characteristics of individual user data patterns. The effectiveness of FedSub is validated in three real-world scenarios characterized by high data heterogeneity, derived from human activity recognition and mobile health applications. Experimental evaluations demonstrate FedSub's performance improvements with respect to the state-of-the-art and significant advancements in personalization for ubiquitous systems based on personal mobile and wearable devices.
Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the balance problem between personalization and generalization in Federated Learning (FL), especially in the case of non - independent and identically distributed (non - IID) data distribution. Specifically: 1. **Challenges of personalization and generalization**: - Existing FL methods have limitations when dealing with data heterogeneity and it is difficult to find a balance between personalization and generalization. - Standard FL methods usually rely on a single global model, which may lead to the inability to capture the unique data patterns of each client, especially in large - scale settings. 2. **Limitations of existing methods**: - **Transfer learning**: Although it is possible to fine - tune the model for each client, it is difficult to balance personalization and generalization in large - scale settings. - **Clustering methods**: It is assumed that the similarity of model parameters reflects the similarity of user data, but this method often oversimplifies the problem because clients may only share similarities on certain labels while having significant differences on other labels. 3. **Challenges of non - IID data**: - In many real - world scenarios, the data distributions generated by different users' devices vary greatly. For example, in mobile health (mHealth) applications and human activity recognition (HAR), the ways in which different users perform the same activity may vary due to individual characteristics, age, and habits. - A single global model is usually inefficient in this case and may lead to sub - optimal performance. To solve these problems, the paper proposes a new algorithm - FedSub, which enhances personalized federated learning by introducing two key concepts: - **Class - aware Prototypes**: Used for compactly representing users' class - level data and clustering on the server side to identify similarities based on specific label patterns. - **Subnetworks**: Extract the necessary components in the local model for handling each class, and fuse these subnetworks on the server side according to the clustering results, thereby generating highly customized model updates. In this way, FedSub can better adapt to diverse user data patterns while maintaining personalization, especially in environments with high data heterogeneity and device heterogeneity. Experimental results show that FedSub significantly improves classification accuracy and model convergence speed in multiple real - world application scenarios and provides a high level of personalized support.