Abstract:In this work, a <italic>Hybrid Hierarchical Federated Edge Learning</italic> (HHFEL) architecture that consists of a device layer, an edge layer, and a cloud layer over heterogeneous networks, is investigated for large-scale model training. In such systems, learning efficiency is severely degraded by limited communication resources and device heterogeneity in terms of local data distribution and computation capability, especially for synchronous FL mechanisms where the training of each round should wait for the slowest device. To tackle this issue, asynchronous FL is proposed, which allows the devices with powerful computation and communication capabilities exchanging information with the server more frequently. However, this asynchronous FL framework faces a new challenge of low accuracy caused by the imbalanced local model updating. To overcome the shortage of both synchronous and asynchronous FLs, we propose an enhanced online semi-asynchronous FL mechanism between the edge-device layers, where each device trains its local model with the newly generated data and each edge server aggregates a number of local models based on their arrival order in each round. Particularly, devices with faster training speeds would fully utilize the idle time by training their local models repetitively. Meanwhile, synchronous FL with an edge elastic update strategy is adopted to the cloud-edge layers for personalized information exchange. Considering the continuous data generation feature, we formulate the objective problem as an online <italic>Markov Decision Process</italic> (MDP) to realize efficient communication-and-computing HHFEL via joint device selection and resource allocation. Due to the non-convex and combinatorial problem structure, we develop a hybrid <italic>Deep Q-Network</italic> (DQN) and <italic>Deep Deterministic Policy Gradient</italic> (DDPG) approach with low computational complexity to adapt the device selection and resource allocation strategies. Numerical results show the effectiveness of the proposed mechanism compared with existing benchmarks.

Towards Efficient Edge Learning for Large Models in Heterogeneous Resource-limited Environments.

Extendable Multi-Device Collaborative Pipeline Parallel Inference in the Edge-Cloud Scenario

Enhanced Hybrid Hierarchical Federated Edge Learning Over Heterogeneous Networks

AccEPT: an Acceleration Scheme for Speeding Up Edge Pipeline-parallel Training

Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing

Resource-efficient Parallel Split Learning in Heterogeneous Edge Computing

Learning-efficient Transmission Scheduling for Distributed Knowledge-aware Edge Learning.

Collaborative Inference for Large Models with Task Offloading and Early Exiting

An Efficient Asynchronous Federated Learning Protocol for Edge Devices

Towards Efficient Model-Heterogeneity Federated Learning for Large Models

ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation

Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge

Automated Exploration and Implementation of Distributed CNN Inference at the Edge

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments

Energy-Efficient Split Learning for Fine-Tuning Large Language Models in Edge Networks

ED-ViT: Splitting Vision Transformer for Distributed Inference on Edge Devices

Cost-Efficient Federated Learning for Edge Intelligence in Multi-Cell Networks

Towards Efficient Asynchronous Federated Learning in Heterogeneous Edge Environments

Efficient federated learning on resource-constrained edge devices based on model pruning

Accelerating DNN Training in Wireless Federated Edge Learning Systems

Adaptive Federated Learning in Resource Constrained Edge Computing Systems