Abstract:Federated learning (FL) is important for privacy-preserving services by training models without collecting raw user data. Most FL algorithms assume all data is annotated, which is impractical due to the high cost of labeling data in real applications. To alleviate the reliance on labeled data, semi-supervised federated learning (SSFL) has been proposed to utilize unlabeled data on clients to improve model performance. However, most existing methods either have privacy issues which share models trained on other clients, or generate pseudo-labels for unlabeled local datasets with the global model, which is usually biased towards the global data distribution. The latter may lead to sub-optimal accuracy of pseudo-labels, due to the gap between the local data distribution and the global model, especially in non-IID settings. In this paper, we propose a semi-supervised heterogeneous federated learning method with local knowledge enhancement, called FedLoKe, which aims to train an accurate global model from both labeled and unlabeled local data with non-IID distributions. Specifically, in FedLoKe, the server maintains a global model to capture global data distribution, and each client learns a local model to capture local data distribution. Since the distribution captured by the local model is aligned with the local data distribution, we utilize it to generate high-accuracy pseudo-labels of the unlabeled dataset for global model training. To prevent the local model from severely overfitting local labeled data, we further use the exponential moving average and apply the global model to generate pseudo-labels for local modeling training. Experiments on four datasets show the effectiveness of FedLoKe. Our code is available at: https://github.com/zcfinal/FedLoKe.

A Local Online Learning Approach for Non-linear Data.

An On-Line Learning Approach with Support Vector Dormain Classifier

Adaptive Local Kernel-Based Learning for Soft Sensor Modeling of Nonlinear Processes

Online Kernel Learning with a Near Optimal Sparsity Bound

Large Scale Online Kernel Classification

Local least squares support vector regression with application to online modeling for batch processes

Local Parameter Optimization of LSSVM for Industrial Soft Sensing with Big Data and Cloud Implementation

Online monitoring of nonlinear multiple mode processes based on adaptive local model approach

Optimizing Federated Learning on Non-IID Data Using Local Shapley Value.

Learning nonlinear manifolds based on mixtures of localized linear manifolds under a self-organizing framework

Exact Soft Confidence-Weighted Learning

A Framework of Sparse Online Learning and Its Applications

Learning a Coupled Linearized Method in Online Setting.

Nonlinear Online Learning with Adaptive Nyström Approximation

Optimal Learning Rates for Localized SVMs

Non-IID always Bad? Semi-Supervised Heterogeneous Federated Learning with Local Knowledge Enhancement

Efficient Online Learning for Large-Scale Sparse Kernel Logistic Regression

Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning

A novel framework for online supervised learning with feature selection

Memory-Efficient LLM Training with Online Subspace Descent

A Multilayer Framework for Online Metric Learning