Combined Federated and Split Learning in Edge Computing for Ubiquitous Intelligence in Internet of Things: State of the Art and Future Directions

Qiang Duan,Shijing Hu,Ruijun Deng,Zhihui Lu
DOI: https://doi.org/10.48550/arXiv.2207.09611
2022-07-20
Abstract:Federated learning (FL) and split learning (SL) are two emerging collaborative learning methods that may greatly facilitate ubiquitous intelligence in Internet of Things (IoT). Federated learning enables machine learning (ML) models locally trained using private data to be aggregated into a global model. Split learning allows different portions of an ML model to be collaboratively trained on different workers in a learning framework. Federated learning and split learning, each has unique advantages and respective limitations, may complement each other toward ubiquitous intelligence in IoT. Therefore, combination of federated learning and split learning recently became an active research area attracting extensive interest. In this article, we review the latest developments in federated learning and split learning and present a survey on the state-of-the-art technologies for combining these two learning methods in an edge computing-based IoT environment. We also identify some open problems and discuss possible directions for future research in this area with a hope to further arouse the research community's interest in this emerging field.
Machine Learning,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to combine Federated Learning (FL) and Split Learning (SL) in the Internet of Things (IoT) environment to achieve Ubiquitous Intelligence. Specifically, the paper focuses on how to overcome the challenges in achieving ubiquitous intelligence in the edge - computing - supported IoT environment by combining these two collaborative learning methods. ### Problem Background 1. **Data Scatter** : - The amount of data generated by IoT devices is huge and highly scattered. Traditional centralized cloud infrastructures cannot meet the requirements for real - time performance and privacy protection. - Transmitting data to the cloud not only consumes bandwidth but may also introduce additional latency, and data related to user privacy needs to be retained on local devices. 2. **Resource Limitations** : - Edge devices usually have limited computing and network resources and it is difficult for them to independently complete the training of complex machine - learning models, especially complex models such as deep neural networks. ### Solutions - **Federated Learning (FL)** : - It allows multiple data owners to collaboratively train a global model without exposing their private data. Each client uses local data to train a local model and then uploads the model parameters to the server for aggregation. - Advantage: Protects data privacy and reduces communication overhead. - Limitation: It requires client devices to have sufficient resources to train a complete model, which is a challenge for resource - limited IoT devices. - **Split Learning (SL)** : - It divides a machine - learning model into multiple parts, and different parts can be collaboratively trained on different nodes. The client executes the front - end part of the model and sends the intermediate results to the server, and the server continues the training of the back - end part and returns the gradients. - Advantage: Allows the client to offload part of the training tasks to the server, making full use of the flexible resource management in edge computing. - Limitation: The sequential collaboration between the client and the server limits the effective use of large - scale distributed data. ### Advantages of Combining FL and SL - **Complementary Advantages** : - FL solves the problems of data privacy and communication efficiency, while SL alleviates the resource limitation problem. The combination of the two can give full play to their respective advantages and at the same time make up for their respective deficiencies. - For example, on resource - limited IoT devices, SL can be used to offload computationally intensive tasks to edge servers, and in terms of data privacy protection, FL can be relied on to ensure that data does not leave the local area. ### Main Contributions of the Paper - **Latest Progress Review** : - This article reviews the latest developments of FL and SL and provides a survey of the current state of technology on how to combine these two methods. - **Open Problems and Future Directions** : - It identifies some unsolved problems and discusses possible directions for future research, aiming to further stimulate the research community's interest in this emerging field. In this way, the paper provides researchers with a comprehensive perspective to help them understand how to achieve more efficient ubiquitous intelligence in the edge - computing - supported IoT environment.