Synergizing Foundation Models and Federated Learning: A Survey

Shenghui Li,Fanghua Ye,Meng Fang,Jiaxu Zhao,Yun-Hin Chan,Edith C.-H. Ngai,Thiemo Voigt
2024-06-19
Abstract:The recent development of Foundation Models (FMs), represented by large language models, vision transformers, and multimodal models, has been making a significant impact on both academia and industry. Compared with small-scale models, FMs have a much stronger demand for high-volume data during the pre-training phase. Although general FMs can be pre-trained on data collected from open sources such as the Internet, domain-specific FMs need proprietary data, posing a practical challenge regarding the amount of data available due to privacy concerns. Federated Learning (FL) is a collaborative learning paradigm that breaks the barrier of data availability from different participants. Therefore, it provides a promising solution to customize and adapt FMs to a wide range of domain-specific tasks using distributed datasets whilst preserving privacy. This survey paper discusses the potentials and challenges of synergizing FL and FMs and summarizes core techniques, future directions, and applications. A periodically updated paper collection on FM-FL is available at <a class="link-external link-https" href="https://github.com/lishenghui/awesome-fm-fl" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on how to combine Federated Learning (FL) and Foundation Models (FMs) to address the challenges of privacy protection and data availability when training high - quality models in specific domains. Specifically: 1. **Privacy protection issues**: In fields such as law, medicine, and finance, data is usually highly sensitive, so it is difficult to collect and use centrally. Traditional model training methods require the centralized storage and processing of a large amount of data, which may lead to the risk of privacy leakage. FL provides a privacy - protecting method by allowing different participants to collaborate in training models without sharing the original data. 2. **Data availability issues**: For FMs in specific domains, a large amount of proprietary data is required for pre - training or fine - tuning, but this data is often difficult to obtain due to privacy restrictions. FL can expand the availability of data without affecting privacy by using decentralized data sources, thereby improving the diversity and robustness of the model. 3. **Model adaptability issues**: In order to make FMs perform well on specific tasks, they usually need to be fine - tuned. However, in the FL environment, due to the heterogeneity of data distribution and differences in computing resources, directly applying traditional fine - tuning methods may lead to performance degradation. Therefore, researching how to effectively fine - tune FMs in the FL framework so that they can adapt to different downstream tasks and device characteristics is one of the focuses of this paper. 4. **Resource efficiency issues**: FMs usually have a large number of parameters and high training and communication costs. In the FL environment, especially in resource - constrained scenarios such as mobile devices, how to optimize the training process of FMs and reduce computing and communication overheads is also an urgent problem to be solved. In summary, this paper aims to explore how to combine the advantages of FL and FM to solve the above challenges, thereby promoting the development of specific - domain models under privacy protection and promoting the development of related technologies.