Xiong Wang,Jiancheng Ye,John C.S. Lui
Abstract:Mobile edge computing facilitates users to offload computation tasks to edge servers for meeting their stringent delay requirements. Previous works mainly explore task offloading when system-side information is given (e.g., server processing speed, cellular data rate), or centralized offloading under system uncertainty. But both generally fall short to handle task placement involving many coexisting users in a dynamic and uncertain environment. In this paper, we develop a multi-user offloading framework considering unknown yet stochastic system-side information to enable a decentralized user-initiated service placement. Specifically, we formulate the dynamic task placement as an online multi-user multi-armed bandit process, and propose a decentralized epoch based offloading (DEBO) to optimize user rewards which are subjected under network delay. We show that DEBO can deduce the optimal user-server assignment, thereby achieving a close-to-optimal service performance and tight O(log T) offloading regret. Moreover, we generalize DEBO to various common scenarios such as unknown reward gap, dynamic entering or leaving of clients, and fair reward distribution, while further exploring when users' offloaded tasks require heterogeneous computing resources. Particularly, we accomplish a sub-linear regret for each of these instances. Real measurements based evaluations corroborate the superiority of our offloading schemes over state-of-the-art approaches in optimizing delay-sensitive rewards.
What problem does this paper attempt to address?
This paper attempts to solve the problem of how to make effective task offloading decisions in the presence of multiple users in the Mobile Edge Computing (MEC) environment. Specifically, the paper focuses on how to achieve decentralized user - initiated service placement in a dynamic and uncertain environment when system - side information (such as server processing speed, transmission data rate, etc.) is unknown, in order to meet users' strict latency requirements and enhance the server's computing services.
### Main problems solved in the paper:
1. **Adaptive offloading under unknown system information**: Since system - side information is usually not available to mobile users, a learning - based adaptive offloading method needs to be designed to optimize user rewards without complete system information.
2. **Fully decentralized service placement**: In large - scale MEC systems, users may not be aware of each other's existence, so a fully decentralized scheme without user - to - user communication needs to be designed.
3. **Handling of heterogeneous user requirements**: Different users have different sensitivities to latency, and their latency - sensitive rewards need to be balanced to ensure fair edge resource allocation.
4. **Management of limited computing capacity**: Edge servers have limited computing capabilities, and good theoretical performance needs to be achieved while respecting capacity limitations.
### Solutions:
The paper proposes a fully decentralized multi - user offloading scheme (DEBO), which solves the above problems by modeling the dynamic task placement as an online multi - user multi - armed bandit (MAB) process. The specific steps are as follows:
1. **Time division**: Divide the total time into multiple epochs, each epoch includes an exploration phase, a matching phase, and an exploitation phase.
- **Exploration phase**: Users randomly select edge servers for task offloading to obtain estimated rewards.
- **Matching phase**: Based on the estimated rewards, users determine the optimal user - server allocation through a decentralized auction mechanism (DAuction).
- **Exploitation phase**: Users perform task offloading according to the results of the matching phase and fully utilize the corresponding rewards.
2. **Decentralized auction mechanism (DAuction)**: Through the decentralized auction mechanism, users can bid for edge server resources based on the estimated reward values, and finally achieve the optimal user - server allocation.
3. **Performance analysis**: The paper proves that the upper bound of the offloading regret of the DEBO scheme is \(O(\log^2 T)\), and it still has good performance in various extended scenarios (such as unknown reward gaps, dynamic user entry and exit, fair reward distribution, etc.).
### Main contributions:
- Propose a fully decentralized multi - user offloading framework that can achieve optimal performance in a dynamic and uncertain MEC environment without any user - to - user communication or system - side information.
- Design the DEBO scheme, which achieves a tight \(O(\log T)\) offloading regret through time division and a decentralized auction mechanism.
- Extend the DEBO scheme to be applicable to various common scenarios, such as unknown reward gaps, dynamic user entry and exit, fair reward distribution, etc., and quantify the sub - linear regret in these extended scenarios.
- Propose the H - DEBO scheme to handle users' heterogeneous offloading requirements and ensure good service performance even without optimal allocation.
Through these contributions, the paper provides an effective and robust solution for multi - user task offloading in the MEC environment.