Abstract:IoT has recently witnessed a boom in AI deployment at the edge as a result of the newly developed small size Machine Learning (ML) models and integrated hardware accelerators. Although it brings huge benefits such as privacy-preserving and low-latency applications, it still suffers from typical resource limitations of edge devices. A new approach aims to deploy multiple inference models varying in size and accuracy onboard the edge device which could alleviate some of these limitations. This dynamic system can be leveraged to provide real-time energy efficient application by smartly allocating inference tasks to inference local models or offload to edge servers based on current constraints. In this work, we tackle the problem of efficiently allocating inference models for a given set of inference tasks between local inference models and edge server models in parallel under given time and energy constraints. This problem is considered strongly NP-hard and therefore we propose LITOSS, a 2-stage framework in which we use a lightweight Genetic Algorithm-based schemer for task scheduling along with a Reinforcement Learning (RL) agent for improving edge server selection. We perform experiments using a raspberry pi with a set of edge servers. Results show that our framework performed relatively faster compared to other meta-heuristic schemes such as LGSTO, Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) while providing higher average accuracy. We also show that using an RL agent to select the best subset of available edge servers increased, or maintained in worst cases, the average accuracy while reducing the average scheduling times.

Task offloading and resource allocation algorithm based on deep reinforcement learning for distributed AI execution tasks in IoT edge computing environments

A Meta Reinforcement Learning-Based Task Offloading Strategy for IoT Devices in an Edge Cloud Computing Environment

Deep Reinforcement Learning for Task Offloading in Edge Computing Assisted Power IoT

An Advanced Deep Reinforcement Learning Algorithm for Three-layer D2D-Edge-Cloud Computing Architecture for Efficient Task Offloading in Internet of Things

Task Offloading Based on LSTM Prediction and Deep Reinforcement Learning for Efficient Edge Computing in IoT

Energy-Aware Selective Inference Task Offloading for Real-Time Edge Computing Applications

Joint DNN Partition and Resource Allocation for Task Offloading in Edge-Cloud-Assisted IoT Environments

A Density-Based Offloading Strategy for IoT Devices in Edge Computing Systems

Computation Offloading in Resource-constrained Edge Computing Systems based on Deep Reinforcement Learning

Deep Reinforcement Learning-Based Computation Offloading and Resource Allocation in IoT

A computational offloading optimization scheme based on deep reinforcement learning in perceptual network

Resource Allocation with Edge Computing in IoT Networks Via Machine Learning

DRL-Based Distributed Task Offloading Framework in Edge-Cloud Environment

Joint Optimization With DNN Partitioning and Resource Allocation in Mobile Edge Computing

Cloud-Edge Collaboration in Industrial Internet of Things: A Joint Offloading Scheme based on Resource Prediction

A novel approach for IoT tasks offloading in edge-cloud environments

Energy-Efficient DNN Partitioning and Offloading for Task Completion Rate Maximization in Multiuser Edge Intelligence

Distributed Inference in Resource-Constrained IoT for Real-Time Video Surveillance

IoT workload offloading efficient intelligent transport system in federated ACNN integrated cooperated edge-cloud networks

Advanced Deep Learning for Resource Allocation and Security Aware Data Offloading in Industrial Mobile Edge Computing