Federating from History in Streaming Federated Learning

Ruirui Zhang,Yifei Zou,Zhenzhen Xie,Xiao Zhang,Peng Li,Zhipeng Cai,Xiuzhen Cheng,Dongxiao Yu
DOI: https://doi.org/10.1145/3641512.3686375
2024-01-01
Abstract:To address the online learning problem in distributed systems, Streaming Federated learning (SFL) enables immediate model training by clients upon collecting new data, finding wide applications in AI-enabled Internet-of-Things and sensor networks. Given the variability in data distribution across different historical periods, the ability to recall and rapidly apply previously encountered data distributions significantly enhances the efficiency and accuracy of model training. In this paper, a demo based on the real-world temperature datasets is presented to demonstrate the importance of history knowledge in local training and the federating process of SFL, which also shows that vanilla federated learning without considering the history knowledge may even be harmful to model training. Observing this, we propose Fed-HIST, a Federated learning framework that enables the clients to learn from the HISTory knowledge of the whole distributed learning system. Unlike direct raw data storage, Fed-HIST employs model architectures to capture the data distributions, offering a more space-efficient and privacy-preserving method of knowledge storage on a server pool. Additionally, a model similarity comparison scheme is designed to retrieve beneficial knowledge from the pool uploaded by the clients in the past. Such a history-aware federation can enhance the efficiency of training each client, only requiring the recurrence of similar data distributions among SFL participants. We validate our framework through extensive simulations on MNIST, Fashion-MINST, CIFAR10, and CIFAR100 datasets, benchmarking against 9 baselines and highlighting the importance of federating from history in SFL problem through necessary ablation studies.
What problem does this paper attempt to address?