Abstract:In this work, a <italic>Hybrid Hierarchical Federated Edge Learning</italic> (HHFEL) architecture that consists of a device layer, an edge layer, and a cloud layer over heterogeneous networks, is investigated for large-scale model training. In such systems, learning efficiency is severely degraded by limited communication resources and device heterogeneity in terms of local data distribution and computation capability, especially for synchronous FL mechanisms where the training of each round should wait for the slowest device. To tackle this issue, asynchronous FL is proposed, which allows the devices with powerful computation and communication capabilities exchanging information with the server more frequently. However, this asynchronous FL framework faces a new challenge of low accuracy caused by the imbalanced local model updating. To overcome the shortage of both synchronous and asynchronous FLs, we propose an enhanced online semi-asynchronous FL mechanism between the edge-device layers, where each device trains its local model with the newly generated data and each edge server aggregates a number of local models based on their arrival order in each round. Particularly, devices with faster training speeds would fully utilize the idle time by training their local models repetitively. Meanwhile, synchronous FL with an edge elastic update strategy is adopted to the cloud-edge layers for personalized information exchange. Considering the continuous data generation feature, we formulate the objective problem as an online <italic>Markov Decision Process</italic> (MDP) to realize efficient communication-and-computing HHFEL via joint device selection and resource allocation. Due to the non-convex and combinatorial problem structure, we develop a hybrid <italic>Deep Q-Network</italic> (DQN) and <italic>Deep Deterministic Policy Gradient</italic> (DDPG) approach with low computational complexity to adapt the device selection and resource allocation strategies. Numerical results show the effectiveness of the proposed mechanism compared with existing benchmarks.

CHEESE: Distributed Clustering-Based Hybrid Federated Split Learning over Edge Networks

Hierarchical Federated Learning with Adaptive Clustering on Non-IID Data

Enhanced Hybrid Hierarchical Federated Edge Learning Over Heterogeneous Networks

Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and Optimization

A Joint Gradient and Loss Based Clustered Federated Learning Design

Accelerating Hierarchical Federated Learning with Model Splitting in Edge Computing

Accelerating Federated Learning with Cluster Construction and Hierarchical Aggregation.

Faster Convergence on Heterogeneous Federated Edge Learning: An Adaptive Clustered Data Sharing Approach

ESFL: Efficient Split Federated Learning over Resource-Constrained Heterogeneous Wireless Devices

Effectively Heterogeneous Federated Learning: A Pairing and Split Learning Based Approach

Enhancing Edge-Assisted Federated Learning with Asynchronous Aggregation and Cluster Pairing

Federated Split Learning for Edge Intelligence in Resource-Constrained Wireless Networks

Low-Latency Hierarchical Federated Learning in Wireless Edge Networks

Accelerating Decentralized Federated Learning in Heterogeneous Edge Computing

Semi-Decentralized Federated Edge Learning with Data and Device Heterogeneity

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

Stochastic Clustered Federated Learning

Cooperative Model Dissemination Strategy for Hierarchical Clustering Learning in Edge Computing

Split Federated Learning: Speed up Model Training in Resource-Limited Wireless Networks

FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients

Federated Split Learning for Distributed Intelligence with Resource-Constrained Devices