FLoCoRA: Federated learning compression with low-rank adaptation

Lucas Grativol Ribeiro,Mathieu Leonardon,Guillaume Muller,Virginie Fresse,Matthieu Arzel

2024-06-20

Abstract:Low-Rank Adaptation (LoRA) methods have gained popularity in efficient parameter fine-tuning of models containing hundreds of billions of parameters. In this work, instead, we demonstrate the application of LoRA methods to train small-vision models in Federated Learning (FL) from scratch. We first propose an aggregation-agnostic method to integrate LoRA within FL, named FLoCoRA, showing that the method is capable of reducing communication costs by 4.8 times, while having less than 1% accuracy degradation, for a CIFAR-10 classification task with a ResNet-8. Next, we show that the same method can be extended with an affine quantization scheme, dividing the communication cost by 18.6 times, while comparing it with the standard method, with still less than 1% of accuracy loss, tested with on a ResNet-18 model. Our formulation represents a strong baseline for message size reduction, even when compared to conventional model compression works, while also reducing the training memory requirements due to the low-rank adaptation.

Machine Learning,Signal Processing

What problem does this paper attempt to address?

The paper aims to address the issues of communication and computational efficiency in Federated Learning (FL). Specifically, the authors propose a new method called FLoCoRA (Federated Learning Compression with Low-Rank Adaptation), which utilizes Low-Rank Adaptation (LoRA) technology to train small visual models. Compared to traditional federated learning, this method can significantly reduce communication costs while achieving this goal with less than a 1% drop in accuracy. The main contributions of the paper include: 1. **Proposing the FLoCoRA framework**: Integrating LoRA technology into the federated learning framework to make model training more efficient. By training and transmitting only the LoRA adapter parameters instead of the entire model, communication overhead is greatly reduced. 2. **Studying the impact of hyperparameters**: Investigating the impact of LoRA hyperparameters on classification tasks and identifying the settings that achieve optimal performance under different configurations. 3. **Introducing a quantization scheme**: Combining affine quantization techniques to further compress communication data. Experiments show that even after quantization, accuracy remains at a high level. 4. **Experimental validation**: Conducting extensive experiments to validate the effectiveness of FLoCoRA, particularly its performance on ResNet-8 and ResNet-18 models, demonstrating its ability to maintain high accuracy while reducing communication costs. In summary, FLoCoRA provides an effective method to reduce communication overhead in federated learning while maintaining model performance, offering new possibilities for federated learning deployment on edge devices.

FLoCoRA: Federated learning compression with low-rank adaptation

Federated LoRA with Sparse Communication

PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation

Improving LoRA in Privacy-preserving Federated Learning

Selective Aggregation for Low-Rank Adaptation in Federated Learning

Exploring Gradient Subspaces: Addressing and Overcoming LoRA's Limitations in Federated Fine-Tuning of Large Language Models

Towards Federated Low-Rank Adaptation with Rank-Heterogeneous Communication

FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations

CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models

LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement

Batched Low-Rank Adaptation of Foundation Models

LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

Federated LLMs Fine-tuned with Adaptive Importance-Aware LoRA

Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning

LoRA+: Efficient Low Rank Adaptation of Large Models

RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS

Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients

LoRA Learns Less and Forgets Less

ResLoRA: Identity Residual Mapping in Low-Rank Adaption