Online Computation Offloading and Resource Scheduling in Mobile-Edge Computing

Tong Liu,Yameng Zhang,Yanmin Zhu,Weiqin Tong,Yuanyuan Yang
DOI: https://doi.org/10.1109/jiot.2021.3051427
IF: 10.6
2021-04-15
IEEE Internet of Things Journal
Abstract:With the explosion of mobile smart devices, many computation intensive applications have emerged, such as interactive gaming and augmented reality. Mobile-edge computing (EC) is put forward, as an extension of cloud computing, to meet the low-latency requirements of the applications. In this article, we consider an EC system built in an ultradense network with numerous base stations. Heterogeneous computation tasks are successively generated on a smart device moving in the network. An optimal task offloading strategy, as well as optimal CPU frequency and transmit power scheduling, is desired by the device user to minimize both task completion latency and energy consumption in a long term. However, due to the stochastic task generation and dynamic network conditions, the problem is particularly difficult to solve. Inspired by reinforcement learning, we transform the problem into a Markov decision process. Then, we propose an attention-based double deep $Q$ network (DDQN) approach, in which two neural networks are employed to estimate the cumulative latency and energy rewards achieved by each action. Moreover, a context-aware attention mechanism is designed to adaptively assign different weights to the values of each action. We also conduct extensive simulations to compare the performance of our proposed approach with several heuristic and DDQN-based baselines.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to perform online computation offloading and resource scheduling in the Mobile - Edge Computing (EC) scenario to minimize task completion latency and device energy consumption. Specifically, the paper focuses on how a mobile device user with continuously generated computation tasks in an ultra - dense network environment can make real - time computation offloading decisions (i.e., execute tasks locally or offload them to the connected base station) and resource scheduling decisions (such as CPU frequency and transmission power adjustment) to minimize task completion latency and energy consumption over a long period. ### Background and Challenges of the Paper With the proliferation of mobile intelligent devices, many computationally - intensive applications such as interactive games and augmented reality have emerged. These applications have high requirements for low latency, but mobile devices are difficult to meet these requirements due to limited computing resources. For this reason, mobile edge computing has been proposed as a technology to extend cloud computing, aiming to provide computing services at the edge of the core network, close to mobile devices to reduce latency. However, due to the random movement of users, the random generation of computation tasks, and the dynamic changes of network conditions, the computation offloading and resource scheduling problems become very complex. ### Main Contributions 1. **Problem Modeling**: By modeling different computation modes, the online computation offloading and resource scheduling problem is formalized as a multi - objective optimization problem. The heterogeneity of tasks, the dynamic changes of network conditions, and the handover latency caused by service migration are considered, and it is strictly proved that this problem is NP - complete. 2. **Markov Decision Process (MDP)**: The problem is re - formulated as a multi - objective MDP, and the corresponding state space, action space, and reward function are defined, in which both latency rewards and energy rewards are considered. 3. **Dual - Deep Q - Network (DDQN) Method Based on Attention Mechanism**: A novel DDQN method based on attention mechanism is proposed to solve the multi - objective optimization problem. This method uses two deep neural networks to estimate the action - value functions that maximize latency and energy rewards, and designs a context - aware attention mechanism to adaptively allocate weights to different action - values. 4. **Performance Evaluation**: The effectiveness of the proposed method is verified through extensive simulation experiments. The results show that this method performs well in balancing task completion latency and energy consumption and has a fast convergence speed. ### Key Technologies of the Solution - **Multi - objective Optimization**: Consider two objectives, task completion latency and energy consumption, and find Pareto - optimal solutions. - **Reinforcement Learning**: Utilize the reinforcement learning framework, especially MDP, to handle decision - making problems in dynamic environments. - **Attention Mechanism**: Introduce an attention mechanism to adaptively adjust the importance of different actions and improve the flexibility and accuracy of decision - making. ### Conclusion The paper proposes a method based on deep reinforcement learning, which effectively solves the online computation offloading and resource scheduling problems in mobile edge computing, can make real - time decisions in dynamic environments, and simultaneously minimizes task completion latency and energy consumption.