Abstract:The emergence of Mobile Edge Computing (MEC) alleviates the large transmission latency resulting from the traditional cloud computing. For the compute-intensive requests such as video analysis, mobile users prefer to obtain a desired quality of experience (QoE) with neglected latency and reduced energy consumption. The popularity of smart devices allows users to release a run of compute-intensive as well as latency-sensitive requests anywhere, which may lead to bursty requests. A single resource-constrained edge server nearby is capable of handling a small amount of requests quickly, yet it seems helpless when encountering bursty compute-intensive requests. Despite the abundance of recently proposed schemes, the majority focus on efficiently scheduling pending requests in a single edge server, and ignored the potential role of edge collaboration to schedule bursty requests. Besides, while some recent studies proposed to finish a task using multiple devices, they focused on collaboration between mobile devices rather than between edge servers. Hence, we propose DeepLoad, a S2S system that schedules the bursty requests with a collaborative method using reinforcement learning (RL). DeepLoad decouples the scheduling decision into AP selection for setting the access point and workload redistribution for collaborative servers. DeepLoad trains a neural network model that picks decisions for each request based on observations collected by mobile devices. DeepLoad learns to make scheduling decisions solely through the resulting performance of historical decisions rather than rely on pre-programmed models or specific assumptions for the environment. Naturally, DeepLoad automatically learns the scheduling algorithm for each request and obtains a gratifying QoE. We aim to maximize the fraction of requests finished before their attached deadlines. Based on the Shanghai taxi trajectory data set, we design a simulator to obtain abundant samples, and leverage two GeForce GTX TITAN Xp GPUs to train the Actor–Critic network. Compared to the state-of-the-art bandwidth-based and server resources-based methods, DeepLoad can achieve a significant improvement in average fraction.

Preemptive Scheduling for Distributed Machine Learning Jobs in Edge-Cloud Networks

Online Scheduling of Machine Learning Jobs in Edge-Cloud Networks

AI-oriented Workload Allocation for Cloud-Edge Computing.

Online Scheduling Algorithm for Heterogeneous Distributed Machine Learning Jobs

DPS: Dynamic Pricing and Scheduling for Distributed Machine Learning Jobs in Edge-Cloud Networks

Reinforcement Learning Based Online Scheduling of Multiple Workflows in Edge Environment

Online Job Scheduling in Distributed Machine Learning Clusters

Online Job Dispatching and Scheduling in Edge-Clouds

Computational-Intelligence-Based Scheduling with Edge Computing in Cyber–Physical Production Systems

OnDisc: Online Latency-Sensitive Job Dispatching and Scheduling in Heterogeneous Edge-Clouds

Adaptive Pricing and Online Scheduling for Distributed Machine Learning Jobs

Dynamic Parallel Multi-Server Selection and Allocation in Collaborative Edge Computing

PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units

Online Approximation Scheme for Scheduling Heterogeneous Utility Jobs in Edge Computing

MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems

Decentralized Scheduling for Concurrent Tasks in Mobile Edge Computing Via Deep Reinforcement Learning

Low-latency job scheduling with preemption for the development of deep learning

Learning Scheduling Bursty Requests in Mobile Edge Computing Using DeepLoad

Distributed Flexible Job Shop Scheduling through Deploying Fog and Edge Computing in Smart Factories Using Dual Deep Q Networks

Joint Task Partitioning and Parallel Scheduling in Device-Assisted Mobile Edge Networks

Cooperative Job Dispatching in Edge Computing Network with Unpredictable Uploading Delay