Abstract:Federated learning (FL) has emerged as a prominent approach for collaborative training of machine learning models across distributed clients while preserving data privacy. However, the quest to balance acceleration and stability becomes a significant challenge in FL, especially on the client-side. In this paper, we introduce FedCAda, an innovative federated client adaptive algorithm designed to tackle this challenge. FedCAda leverages the Adam algorithm to adjust the correction process of the first moment estimate $m$ and the second moment estimate $v$ on the client-side and aggregate adaptive algorithm parameters on the server-side, aiming to accelerate convergence speed and communication efficiency while ensuring stability and performance. Additionally, we investigate several algorithms incorporating different adjustment functions. This comparative analysis revealed that due to the limited information contained within client models from other clients during the initial stages of federated learning, more substantial constraints need to be imposed on the parameters of the adaptive algorithm. As federated learning progresses and clients gather more global information, FedCAda gradually diminishes the impact on adaptive parameters. These findings provide insights for enhancing the robustness and efficiency of algorithmic improvements. Through extensive experiments on computer vision (CV) and natural language processing (NLP) datasets, we demonstrate that FedCAda outperforms the state-of-the-art methods in terms of adaptability, convergence, stability, and overall performance. This work contributes to adaptive algorithms for federated learning, encouraging further exploration.

Efficient Federated Learning for Modern NLP

Efficient Federated Learning with Pre-Trained Large Language Model Using Several Adapter Mechanisms

Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter

Dual-Personalizing Adapter for Federated Foundation Models

FewFedWeight: Few-shot Federated Learning Framework Across Multiple NLP Tasks

FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering

Towards Practical Few-shot Federated NLP

FedPT: Federated Proxy-Tuning of Large Language Models on Resource-Constrained Edge Devices

FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning

Towards Efficient Model-Heterogeneity Federated Learning for Large Models

Federated Few-Shot Learning for Mobile NLP.

FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive Personalization

Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices

Unlocking FedNL: Self-Contained Compute-Optimized Implementation

FedLion: Faster Adaptive Federated Optimization with Fewer Communication

Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models

Why Go Full? Elevating Federated Learning Through Partial Network Updates

Conquering the Communication Constraints to Enable Large Pre-Trained Models in Federated Learning

FedPFT: Federated Proxy Fine-Tuning of Foundation Models

FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model

Efficient Federated Learning via Local Adaptive Amended Optimizer with Linear Speedup