FedAWS: A Federated Tuning Approach with Adaptive Weight Shrinking for Pre-trained Foundation Models

Yubin Cheng,Zhihuan Song,Xiaoyu Jiang,Le Yao
DOI: https://doi.org/10.1109/ddcls61622.2024.10606783
2024-01-01
Abstract:In recent years, pre-trained foundation models have developed rapidly and have been applied to various downstream tasks through fine-tuning methods. However, due to the existence of data silos, the lack of data for fine-tuning has become a vital issue preventing the application of the foundation model to industry. In addition, the high complexity of the network makes the foundation model prone to overfitting when used for data-constrained industrial tasks. Therefore, a federated tuning approach with adaptive weight shrinking, named FedAWS, is proposed for collaborative fine-tuning between factories. FedAWS adaptively selects the weight-shrinking factor to reduce overfitting and improve the generalization of the global model without relying on additional data sets or human experience. The case study on an industrial dataset demonstrates the effectiveness of FedAWS. This successful validation provides a new way for inter-factory collaboration to apply the foundation model to industry, thus facilitating the efficient utilization of industrial data.
What problem does this paper attempt to address?